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ANGIOGENISIS ASSOCIATES PROTEINS, AND NUCLEIC ACIDS 



ENCODING THE SAME 



Related Applications 

This application claims priority to U.S. provisional application Serial No. 
60/191,134 filed 03/22/2000, which is incorporated herein by reference in its entirety. 

Background 

Cities have roads and alleys, plants have xylem and phloem, and people have 
arteries, veins and lymphatics. Without these byways, the vertebrate animal cells would 
starve or drown in their metabolic refuse. Not only do blood vessels deliver food and 
oxygen and carry away metabolic wastes, but they also transport signaling substances that 
apprise cells of situations remote to them but to which they need to respond. Hormonal 
messages are a common signal. 

All blood vessels are ensheathed by a basal lamina and a delicate monolayer of 
remarkably plastic endothelial cells lining the luminal walls. Depending on location and 
function, smooth muscle and connective tissue may also be present. 

Not only do healthy cells depend on the blood resources transported by the 
circulatory system, but so, too, unwanted cells: tumorigenic and malignant cells. These 
cells colonize and proliferate if they are able to divert blood resources to themselves. 
Angiogenesis, the type of blood vessel formation where new vessels emerge from the 
proliferation of preexisting vessels (Risau, 1995; Risau and Flamme, 1995), is exploited 
not only by usual processes, such as in wound healing or myocardial infarction repair, but 
also by tumors themselves and in cancers, diabetic retinopathy, macular degeneration, 
psoriasis, and rheumatoid arthritis. Regardless of the process, whether pathological or 
usual physiological, endothelial cells mediate angiogenesis in a multi-step fashion: (1) 
endothelia receive an extracellular cue, (2) the signaled cells breach the basal lamina 
sheath, abetted by proteases they secrete, (3) the cells then migrate to the signal and 
proliferate, and finally, (4) the cells form a tube, a morphogenic event (Alberts et aL, 
1994). The complexity of this process indicates complex changes in cellular physiology 
and morphology, gene expression, and signaling. Angiogenic accomplices that are cues 



include basic fibroblast growth factors (bFGF), angiopoietins (such as ANG1) and 
various forms of vascular endothelial growth factor (VEGF). 

The molecular events and the order in which they occur and the pathways that are 
required for this process are of fundamental importance to understand angiogenesis. In 
vitro models are useful for identifying alterations in gene expression that occur during 
angiogenesis. A particularly fruitful model systems involves the supspension in a three- 
dimensional type I collagen gel and various stimuli, such as phorbol myristate acetate 
(PMA), basic fibroblast growth factor (bFGF), and VEGF. The combination of the 
stimuli and the collagen gel results in the formation of a three-dimensional tubular 
network of endothelial cells with interconnecting lumenal structures. In this model, 
endothelial differentiation into tubelike structures is completely blocked by inhibitors of 
new mRNA or protein synthesis. Furthermore, the cells progress through differentiation 
in a coordinated and synchronized manner, thus optimizing the profile of gene expression 
(Kahn et al. y 2000; Yang et a/., 1999). 

Tumor cells exploit angiogenesis to facilitate tumor growth. Controlling 
angiogenesis, by controlling the activity or expression of genes and proteins associated 
with angiogenesis, provides a way to prevent tumor growth, or even destoy tumors. 

Summary 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the 
invention, and derivatives and fragments thereof, will hereinafter be collectively 
designated as "AAP" nucleic acid or polypeptide sequences. AAP, or angiogenesis 
associated polypeptides (AAP) comprises kelch-like polypeptide (KLP), human ortholog 
of mouse B AZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG 
(hTRG), human myosin X (hMXl) and its splice variant (hMX2), nuclear hormone 
receptor (NHR), and human mitochondrial protein (hMP). 

The invention is based in part upon the discovery of novel nucleic acid sequences 
encoding novel polypeptides. Nucleic acids encoding the polypeptides disclosed in the 
invention, and derivatives and fragments thereof, will hereinafter be collectively 
designated as "AAP" nucleic acid or polypeptide sequences." 
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In a first aspect, the present invention is an isolated polypeptide having at least 
80% sequence identity to the sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, 
polynucleotides encoding the same, and antibodies that specifically bind the same. 

In a second aspect, the present invention is an isolated polynucleotide having at 
least 80% sequence identity to the sequence SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 or 15, or a 
complement thereof. 

In a third aspect, the present invention is a transgenic non-human animal, having a 
disrupted AAP gene or a transgenic non-human animal expressing an exogenous 
polynucleotide having at least 80% sequence identity to the sequence SEQ ID NOS:l, 3, 
5, 7, 9, 11, 13 or 15, or a complement of said polynucleotide. 

In a fourth aspect, the present invention is a method of screening a sample for a 
mutation in an AAP gene. 

In a fifth aspect, the present invention is a method of modulating angiogenesis 
comprising modulating the activity of at least one AAP polypeptide. 

In a sixth aspect, the present invention is a method of increasing, as well as 
decreasing angiogenesis, comprising modulating the activity of at least one AAP 
polypeptide. Activity modulation of AAP polypeptides may be over-expressing or 
eliminating expression of the gene, or impairing a AAP polypeptide's function by contact 
with specific antagonists or agonists, such as antibodies or aptamers. 

In a seventh aspect, the present invention is a method of treating various 
pathologies, including tumors, cancers, myocardial infarctions and the like. 

In an eighth aspect, the present invention is a method of measuring a AAP 
transcriptional and translational up-regulation or down-regulation activity of a compound. 

In a ninth aspect, the invention is a method of screening a tissue sample for 
tumori genie potential. 

In a tenth aspect, the invention is a method of determining the clinical stage of 
tumor which compares the expression of at least one AAP in a sample with expression of 
said at least one gene in control samples. 

Although methods and materials similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, suitable methods and 
materials are described below. All publications, patent applications, patents, and other 



references mentioned herein are incorporated by reference in their entirety. In the case of 
conflict, the present specification, including definitions, will control. In addition, the 
materials, methods, and examples are illustrative only and not intended to be limiting. 

Detailed description 

A model of angiogenesis-the suspension of endothelial cells in type I collagen gels 
with various stimuli-was used to identify a molecular fingerprint or transcriptional profile 
of endothelial differentiation into tubelike structures, using amplification and an imaging 
approach called GeneCalling (Shimkets et al., 1999). This method was previously shown 
to provide a comprehensive sampling of cDNA populations in conjunction with the 
sensitive detection of quantitative differences in mRNA abundance for both known and 
novel genes. Many differentially expressed cDNA fragments were found. The 
identification and differential expression of these gens was confirmed by a second 
independent method employing real-time quantitative polymerase chain reaction (PCR). 
Although some of the identified cDNA fragments were genes known to play some role in 
angiogenesis, many other differentially expressed genes were unexpected. The inventors 
have identified the unexpected genes and polypeptides that are expressed in response to 
this model of angiogenesis, collectively refered to as angiogenesis associated 
polypeptides (AAP). AAP are kelch-like polypeptide (KLP), human ortholog of mouse 
BAZF (hBAZF), hmt-elongation factor G (hEF-G), human ortholog of rat TRG (hTRG), 
human myosin X (hMXl) and its splice variant (hMX2), nuclear hormone receptor 
(NHR), and human mitochondrial protein (hMP). 

Definitions 

Unless defined otherwise, all technical and scientific terms have the same 
meaning as is commonly understood by one of skill in the art to which this invention 
belongs. The definitions below are presented for clarity. All patents and publications 
referred to herein are, unless noted otherwise, incorporated by reference in their entirety. 

The recommendations of (Demerec et al. 9 1966) where these are relevant to 
genetics are adapted herein. To distinguish between genes (and related nucleic acids) and 
the proteins that they encode, the abbreviations for genes are indicated by italicized (or 



underlined) text while abbreviations for the proteins start with a capital letter and are not 
italicized. Thus, AAP or AAP refers to the nucleotide sequence that encodes AAP. 

"Isolated," when referred to a molecule, refers to a molecule that has been 
identified and separated and/or recovered from a component of its natural environment. 
Contaminant components of its natural environment are materials that interfere with 
diagnostic or therapeutic use. 

"Container" is used broadly to mean any receptacle for holding material or 
reagent. Containers may be fabricated of glass, plastic, ceramic, metal, or any other 
material that can hold reagents. Acceptable materials will not react adversely with the 
contents. 

1 . Nucleic acid-related definitions 

(a) control sequences 

Control sequences are DNA sequences that enable the expression of an operably- 
linked coding sequence in a particular host organism. Prokaryotic control sequences 
include promoters, operator sequences, and ribosome binding sites. Eukaryotic cells 
utilize promoters, polyadenylation signals, and enhancers. 

(b) operably-linked 

Nucleic acid is operably-linked when it is placed into a functional relationship 
with another nucleic acid sequence. For example, a promoter or enhancer is operably- 
linked to a coding sequence if it affects the transcription of the sequence, or a ribosome- 
binding site is operably-linked to a coding sequence if positioned to facilitate translation. 
Generally, "operably-linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, 
enhancers do not have to be contiguous. Linking is accomplished by conventional 
recombinant DNA methods. 

(c) isolated nucleic acids 

An isolated nucleic acid molecule is purified from the setting in which it is found 
in nature and is separated from at least one contaminant nucleic acid molecule. Isolated 
AAP molecules are distinguished from the specific AAP molecule, as it exists in cells. 
However, an isolated AAP molecule includes AAP molecules contained in cells that 



ordinarily express an AAP where, for example, the nucleic acid molecule is i 
chromosomal location different from that of natural cells. 



2. Protein-related definitions 

(a) purified polypeptide 

When the molecule is a purified polypeptide, the polypeptide will be purified (1) 
to obtain at least 15 residues of N-terminal or internal amino acid sequence using a 
sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing 
conditions using Coomassie blue or silver stain. Isolated polypeptides include those 
expressed heterologously in genetically-engineered cells or expressed in vitro, since at 
least one component of an AAP natural environment will not be present. Ordinarily, 
isolated polypeptides are prepared by at least one purification step. 

(b) active polypeptide 

An active AAP or AAP fragment retains a biological and/or an immunological 
activity of the native or naturally-occurring AAP. Immunological activity refers to the 
ability to induce the production of an antibody against an antigenic epitope possessed by a 
native AAP; biological activity refers to a function, either inhibitory or stimulatory, 
caused by a native AAP that excludes immunological activity. A biological activity of 
AAP includes, for example, modulating angiogenesis. 

(c) Abs 

Antibody may be single anti-AAP monoclonal Abs (including agonist, antagonist, 
and neutralizing Abs), anti-AAP antibody compositions with polyepitopic specificity, 
single chain anti-AAP Abs, and fragments of anti-AAP Abs. A "monoclonal antibody- 
refers toan antibody obtained from a population of substantially homogeneous Abs, i.e., 
the individual Abs comprising the population are identical except for naturally-occurring 
mutations that may be present in minor amounts 

(d) epitope tags 

An epitope tagged polypeptide refers to a chimeric polypeptide fused to a "tag 
polypeptide". Such tags provide epitopes against which Abs can be made or are 
available, but do not interfere with polypeptide activity. To reduce anti-tag antibody 
reactivity with endogenous epitopes, the tag polypeptide is preferably unique. Suitable tag 
polypeptides generally have at least six amino acid residues and usually between about 8 



and 50 amino acid residues, preferably between 8 and 20 amino acid residues). Examples 
of epitope tag sequences include HA from Influenza A virus and FLAG. 

The novel AAP of the invention include the nucleic acids whose sequences are 
provided in Tables 1, 3, 5, 7, 9, 11, 13 and 15, or a fragment thereof. The invention also 
includes a mutant or variant AAP, any of whose bases may be changed from the 
corresponding base shown in Tables 1, 3, 5, 7, 9, 11, 13 and 15 while still encoding a 
protein that maintains the activities and physiological functions of the AAP fragment, or a 
fragment of such a nucleic acid. The invention further includes nucleic acids whose 
sequences are complementary to those just described, including complementary nucleic 
acid fragments. The invention additionally includes nucleic acids or nucleic acid 
fragments, or complements thereto, whose structures include chemical modifications. 
Such modifications include, by way of nonlimiting example, modified bases, and nucleic 
acids whose sugar phosphate backbones are modified or derivatized. These modifications 
are carried out at least in part to enhance the chemical stability of the modified nucleic 
acid, such that they may be used, for example, as anti-sense binding nucleic acids in 
therapeutic applications in a subject. In the mutant or variant nucleic acids, and their 
complements, up to 20% or more of the bases may be so changed. 

The novel AAP of the invention include the protein fragments whose sequences 
are provided in Tables 2, 4, 6, 8, 10, 12, 14 and 16. The invention also includes an AAP 
mutant or variant protein, any of whose residues may be changed from the corresponding 
residue shown in Tables 2, 4, 6, 8, 10, 12, 14 and 16 while still encoding a protein that 
maintains its native activities and physiological functions, or a functional fragment 
thereof. In the mutant or variant AAP, up to 20% or more of the residues may be so 
changed The invention further encompasses Abs and antibody fragments, such as F ab or 
(Fab)'2, that bind immunospecifically to any of the AAP of the invention. 

The AAP nucleic acids are shown in Tables 1, 3, 5, 7, 9, 1 1, 13 and 15, and the 
corresponding polypeptides are shown in Tables 2, 4, 6, 8, 10, 12, 14 and 16, respectivly. 
Start and stop codons in the polynucleotide sequences are indicated in boldface and with 
underlining. SEQ ID NO:3 lacks a stop codon. The sequences of hMXl and hMX2 do 
not have start codons (see Table 17); consequently, hMXl and hMX2 polypeptides do not 
start with a Met. For any lacking polynucleotide sequence, one of skill in the art may 
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retrieve the full length sequence by, for example, probing cDNA or genomic libraries 
with probes designed according to the sequences of the instant invention. 

Table 1 KLP nucleotide sequence (SEQ ID NO: 1) 

ctggcctaga tactacaact gaactttttt tctttttagt tactccacag gatccgctga 60 

acataggatg ttgccacaaa atctacctcg tgtatttttc tctttcactc atgagctgca 120 

caattgcaga tttgagcaca atgtctgcag actgtgttga aaaactctga agaacctaat 180 

taacacagga tgacctagga gtgattctaa gtctgtgtaa caagatatta ctcattagtg 240 

aatgtgtcag tcttggtact gaatgctgca gataacagca agtaggttct cctttatttc 300 

tgaagtattc acttgacctt ccatcagtaa gacggacttt tctaatctgt tcctggagat 360 

attaatggaa tacaatc atg tccactcaag acgagaggca gatcaatact gaatatgctg 420 

tgtcattgtt ggaacagttg aaactgtttt atgaacagca gttgtttact gacatagtgt 480 

taattgttga gggcactgaa ttcccttgtc ataagatggt tcttgcaaca tgtagctctt 540 

atttcagggc catgtttatg agtggactaa gtgaaagcaa acaaacccat gtacacctga 600 

ggaatgtcga tgctgccacc ttacagataa taataactta tgcatacacg ggtaacttgg 660 

caatgaatga cagcactgta gaacagcttt atgaaacagc ttgcttccta caggtagaag 720 

atgtgttaca acgttgtcga gaatatttaa ttaaaaaaat aaatgcagag aattgtgtac 780 

gattgttgag ttttgctgat ctcttcagtt gtgaggaatt aaaacagagt gctaaaagaa 840 

tggtggagca caagttcact gctgtgtatc atcaggacgc gttcatgcag ctgtcacatg 900 

acctactgat agatattctc agtagtgaca atttaaatgt agaaaaggaa gaaaccgttc 960 

gagaagctgc tatgctgtgg ctagagtata acacagaatc acgatcccag tatttgtctt 1020 

ctgttcttag ccaaatcaga attgatgcac tttcagaagt aacacagaga gcttggtttc 1080 

aaggtctgcc acccaatgat aagtcagtgg tggttcaagg tctgtataag tccatgccca 1140 

agtttttcaa accaagactt gggatgacta aagaggaaat gatgattttc attgaagcat 1200 

cttcagaaaa tccttgtagt ctttactctt ctgtctgtta cagcccccaa gcagaaaaag 1260 

tttacaagtt atgtagccca ccagctgatt tgcataaggt tgggaccgtt gtaactcctg 1320 

ataatgatat ctacatagca gggggtcaag ttcctctgaa aaacacaaaa acaaatcaca 1380 

gtaaaacaag caaacttcag actgccttca gaactgtgaa ttgcttttat tggtttgatg 1440 

cacagcaaaa tacctggttt ccaaagaccc caatgctttt tgtccgcata aagccatctt 1500 

tggtttgctg tgaaggctat atctatgcaa ttggaggaga tagcgtaggt ggagaactta 1560 

atcggaggac cgtagaaaga tacgacactg agaaagatga gtggacgatg gtaagccctt 162 0 

taccttgtgc ttggcaatgg agtgcagcag ttgtggttca tgactgcatt tatgtgatga 1680 

cactgaacct catgtactgt tattttccaa ggtctgactc atgggtagaa atggccatga 1740 

gacagactag taggtccttt gcttcagctg cagcttttgg tgataaaatt ttctatattg 1800 

gagggttgca tattgctacc aattccggca taagagtccc ctctggcact gtagatgggt 1860 

cttcagtaac tgtggaaatt tatgatgtga ataaaaatga gtggaaaatg gcagccaaca 1920 

tccctgctaa gaggtactct gacccctgtg ttagagctgt tgtgatctca aattctctat 1980 

gtgtgtttat gcgagaaacc cacttaaatg agcgagctaa atacgtcacc taccaatatg 2040 

acctggaact tgaccggtgg tctctgcggc agcatatatc tgaacgtgta ctgtgggact 2100 

tggggagaga ttttcgatgc actgtgggga aactctatcc atcctgcctt gaagagtctc 2160 

catggaaacc accaacttat cttttttcaa cggatgggac agaagagttt gaactggatg 222 0 

gagaaatggt tgcactacca cctata tag t ggggaagttc agggagtgca cgcctgagtt 2280 
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atgtgctttg 


tcattttctt 


tgctaaacaa 


aagaggctat 


gaaagaacta 


aatatgagta 


2340 


cataaaattc 


tatctttgat 


aaattttatt 


tttatgccct 


acttaatatt 


tgcatcagta 


2400 


taatatatat 


cagtgagtct 


tacagaaaga 


tatgcttcca 


taatatgaaa 


tagattattc 


2460 


aataattgag 


aaactttatg 


tgtaatcatg 


agagtataag 


aatctggatt 


atctaacatt 


2520 


gttagccctg 


tgtatgtaca 


gttcaaaaag 


ttcatttata 


aaagtagttt 


cctgttc 


2577 



Table 2 KLP polypeptide sequence (SEQ ID NO:2) 



Met 
1 


Ser 


Thr 


Gin 


Asp 
5 


Glu 


Arg 


Gin 


He 


Asn 
10 


Thr 


Glu 


Tyr 


Ala 


Val 
15 


Ser 


Leu 


Leu 


Glu 


Gin 
20 


Leu 


Lys 


Leu 


Phe 


Tyr 
25 


Glu 


Gin 


Gin 


Leu 


Phe 
30 


Thr 


Asp 


He 


Val 


Leu 
35 


He 


Val 


Glu 


Gly 


Thr 
40 


Glu 


Phe 


Pro 


Cys 


His 
45 


Lys 


Met 


Val 


Leu 


Ala 
50 


Thr 


Cys 


Ser 


Ser 


Tyr 
55 


Phe 


Arg 


Ala 


Met 


Phe 
60 


Met 


Ser 


Gly 


Leu 


Ser 
65 


Glu 


Ser 


Lys 


Gin 


Thr 
70 


His 


Val 


His 


Leu 


Arg 
75 


Asn 


Val 


Asp 


Ala 


Ala 
80 


Thr 


Leu 


Gin 


He 


He 
85 


He 


Thr 


Tyr 


Ala 


Tyr 
90 


Thr 


Gly 


Asn 


Leu 


Ala 
95 


Met 


Asn 


Asp 


Ser 


Thr 
100 


Val 


Glu 


Gin 


Leu 


Tyr 
105 


Glu 


Thr 


Ala 


Cys 


Phe 
110 


Leu 


Gin 


Val 


Glu 


Asp 
115 


Val 


Leu 


Gin 


Arg 


Cys 
120 


Arg 


Glu 


Tyr 


Leu 


He 
125 


Lys 


Lys 


He 


Asn 


Ala 
130 


Glu 


Asn 


Cys 


Val 


Arg 
135 


Leu 


Leu 


Ser 


Phe 


Ala 
140 


Asp 


Leu 


Phe 


Ser 


Cys 
145 


Glu 


Glu 


Leu 


Lys 


Gin 
150 


Ser 


Ala 


Lys 


Arg 


Met 
155 


Val 


Glu 


His 


Lys 


Phe 
160 


Thr 


Ala 


Val 


Tyr 


His 
165 


Gin 


Asp 


Ala 


Phe 


Met 
170 


Gin 


Leu 


Ser 


His 


Asp 
175 


Leu 


Leu 


He 


Asp 


He 
180 


Leu 


Ser 


Ser 


Asp 


Asn 
185 


Leu 


Asn 


Val 


Glu 


Lys 
190 


Glu 


Glu 


Thr 


Val 


Arg 
195 


Glu 


Ala 


Ala 


Met 


Leu 
200 


Trp 


Leu 


Glu 


Tyr 


Asn 
205 


Thr 


Glu 


Ser 


Arg 


Ser 
210 


Gin 


Tyr 


Leu 


Ser 


Ser 
215 


Val 


Leu 


Ser 


Gin 


He 
220 


Arg 


He 


Asp 


Ala 


Leu 
225 


Ser 


Glu 


Val 


Thr 


Gin 
230 


Arg 


Ala 


Trp 


Phe 


Gin 
235 


Gly 


Leu 


Pro 


Pro 


Asn 
240 


Asp 


Lys 


Ser 


Val 


Val 
245 


Val 


Gin 


Gly 


Leu 


Tyr 
250 


Lys 


Ser 


Met 


Pro 


Lys 
255 


Phe 


Phe 


Lys 


Pro 


Arg 
260 


Leu 


Gly 


Met 


Thr 


Lys 
265 


Glu 


Glu 


Met 


Met 


He 
270 


Phe 


He 
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Glu 


Ala 


Ser 
275 


Ser 


Glu 


Asn 


Pro 


Cys 
280 


Ser 


Leu 


Tyr 


Ser 


Ser 
285 


Val 


Cys 


Tyr 


Ser 


Pro 
290 


Gin 


Ala 


Glu 


Lys 


Val 
295 


Tyr 


Lys 


Leu 


Cys 


Ser 
300 


Pro 


Pro 


Ala 


Asp 


Leu 
305 


His 


Lys 


Val 


Gly 


Thr 
310 


Val 


Val 


Thr 


Pro 


Asp 
315 


Asn 


Asp 


He 


Tyr 


He 
320 


Ala 


Gly 


Gly 


Gin 


Val 
325 


Pro 


Leu 


Lys 


Asn 


Thr 
330 


Lys 


Thr 


Asn 


His 


Ser 
335 


Lys 


Thr 


Ser 


Lys 


Leu 
340 


Gin 


Thr 


Ala 


Phe 


Arg 
345 


Thr 


Val 


Asn 


Cys 


Phe 
350 


Tyr 


Trp 


Phe 


Asp 


Ala 
355 


Gin 


Gin 


Asn 


Thr 


Trp 
360 


Phe 


Pro 


Lys 


Thr 


Pro 
365 


Met 


Leu 


Phe 


Val 


Arg 
370 


He 


Lys 


Pro 


Ser 


Leu 
375 


Val 


Cys 


Cys 


Glu 


Gly 
380 


Tyr 


He 


Tyr 


Ala 


He 
385 


Gly 


Gly 


Asp 


Ser 


Val 
390 


Gly 


Gly 


Glu 


Leu 


Asn 
395 


Arg 


Arg 


Thr 


Val 


Glu 
400 


Arg 


Tyr 


Asp 


Thr 


Glu 
405 


Lys 


Asp 


Glu 


Trp 


Thr 
410 


Met 


Val 


Ser 


Pro 


Leu 
415 


Pro 


Cys 


Ala 


Trp 


Gin 
420 


Trp 


Ser 


Ala 


Ala 


Val 
425 


Val 


Val 


His 


Asp 


Cys 
430 


He 


Tyr 


Val 


Met 


Thr 
435 


Leu 


Asn 


Leu 


Met 


Tyr 
440 


Cys 


Tyr 


Phe 


Pro 


Arg 
445 


Ser 


Asp 


Ser 


Trp 


Val 
450 


Glu 


Met 


Ala 


Met 


Arg 
455 


Gin 


Thr 


Ser 


Arg 


Ser 
460 


Phe 


Ala 


Ser 


Ala 


Ala 
465 


Ala 


Phe 


Gly 


Asp 


Lys 
470 


He 


Phe 


Tyr 


He 


Gly 
475 


Gly 


Leu 


His 


He 


Ala 
480 


Thr 


Asn 


Ser 


Gly 


He 
485 


Arg 


Leu 


Pro 


Ser 


Gly 
490 


Thr 


Val 


Asp 


Gly 


Ser 
495 


Ser 


Val 


Thr 


Val 


Glu 
500 


He 


Tyr 


Asp 


Val 


Asn 
505 


Lys 


Asn 


Glu 


Trp 


Lys 
510 


Met 


Ala 


Ala 


Asn 


He 
515 


Pro 


Ala 


Lys 


Arg 


Tyr 
520 


Ser 


Asp 


Pro 


Cys 


Val 
525 


Arg 


Ala 


Val 


Val 


He 
530 


Ser 


Asn 


Ser 


Leu 


Cys 
535 


Val 


Phe 


Met 


Arg 


Glu 
540 


Thr 


His 


Leu 


Asn 


Glu 
545 


Arg 


Ala 


Lys 


Tyr 


Val 
550 


Thr 


Tyr 


Gin 


Tyr 


Asp 
555 


Leu 


Glu 


Leu 


Asp 


Arg 
560 


Trp 


Ser 


Leu 


Arg 


Gin 
565 


His 


He 


Ser 


Glu 


Arg 
570 


Val 


Leu 


Trp 


Asp 


Leu 
575 


Gly 


Arg 


Asp 


Phe 


Arg 
580 


Cys 


Thr 


Val 


Gly 


Lys 
585 


Leu 


Tyr 


Pro 


Ser 


Cys 
590 


Leu 


Glu 


Glu 


Ser 


Pro 
595 


Trp 


Lys 


Pro 


Pro 


Thr 
600 


Tyr 


Leu 


Phe 


Ser 


Thr 
605 


Asp 


Gly 


Thr 


Glu 


Glu 
610 


Phe 


Glu 


Leu 


Asp 


Gly 
615 


Glu 


Met 


Val 


Ala 


Leu 
620 


Pro 


Pro 


Val 
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Table 3 hBAZF nucleotide sequence (SEQ ID NO:3) 

caagggagcg agggtgtcgt agagggcaga atgaacaaga agaattagga gggaggctgc 60 

gtgtgccggg gctaggggct ggaagtcctg gctctagttg cacctcggaa ggaaaaggca 120 

aacagaggag ggaaggcgtc ttaggactgc ctggatccag agcactttcc tcggcctcta 180 

caggcctgtg tcgctatqgg ttcccccgcc gccccggagg gagcgctggg ctacgtccgc 240 

gagttcactc gccactcctc cgacgtgctg ggcaacctca acgagctgcg cctgcgcggg 300 

atcctcactg acgtcacgct gctggttggc gggcaacccc tcagagcaca caaggcagtt 360 

ctcatcgcct gcagtggctt cttctattca attttccggg gccgtgcggg agtcggggtg 420 

gacgtgctct ctctgcccgg gggtcccgaa gcgagaggct tcgcccctct attggacttc 480 

atgtacactt cgcgcctgcg cctctctcca gccactgcac cagcagtcct agcggccgcc 540 

acctatttgc agatggagca cgtggtccag gcatgccacc gcttcatcca ggccagctat 600 

gaacctctgg gcatctccct gcgccccctg gaagcagaac ccccaacacc cccaacggcc 660 

cctccaccag gtagtcccag gcgctccgaa ggacacccag acccacctac tgaatctcga 720 

agctgcagtc aaggcccccc cagtccagcc agccctgacc ccaaggcctg caactggaaa 780 

aagtacaagt acatcgtgct aaactctcag gcctcccaag cagggagcct ggtcggggag 840 

agaagttctg gtcaaccttg cccccaagcc aggctcccca gtggagacga ggcctccagc 900 

agcagcagca gcagcagcag cagcagcagt gaagaaggac ccattcctgg tccccagagc 960 

aggctctctc caactgctgc cactgtgcag ttcaaatgtg gggctccagc cagtaccccc 1020 

tacctcctca catcccaggc tcaagacacc tctggatcac cctctgaacg ggctcgtcca 1080 

ctacccggga gtgaattttt cagctgccag aactgtgagg ctgtggcagg gtgctcatcg 1140 

gggctggact ccttggttcc tggggacgaa gacaaaccct ataagtgtca gctgtgccgg 1200 

tcttcgttcc gctacaaggg caaccttgcc agtcaccgta cagtgcacac aggggaaaag 1260 

ccttaccact gctcaatctg cggagcccgt tttaaccggc cagcaaacct gaaaacgcac 1320 

agccgcatcc attcgggaga gaagccgtat aagtgtgaga cgtgcggctc gcgctttgta 1380 

caggtacgga gccagcctcc aagtggcttc caaggcaaac ctgcaagagg tggggtgggc 1440 

caaaagggag ggttctgttc ctcccagagg caggacttga agtctcctcc ctcccaggtg 1500 

gcacatctgc gggcgcacgt gctgatccac accggggaga agccctaccc ttgccctacc 1560 

tgcggaaccc gcttccgcca cctgcagacc ctcaagagcc acgttcgcat ccacaccgga 1620 

gagaagcctt accactgcga cccctgtggc ctgcatttcc ggcacaagag tcaactgcgg 1680 

ctgcatctgc gccagaaaca cggagctgct accaacacca aagtgcacta ccacattctc 1740 

ggggggccc 1749 



Table 4 hBAZF polypeptide sequence (SEQ ID NO:4) 

Met Gly Ser Pro Ala Ala Pro Glu Gly Ala Leu Gly Tyr Val Arg Glu 

15 10 15 

Phe Thr Arg His Ser Ser Asp Val Leu Gly Asn Leu Asn Glu Leu Arg 

20 25 30 

Leu Arg Gly lie Leu Thr Asp Val Thr Leu Leu Val Gly Gly Gin Pro 
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Leu 


Arg 
50 


Ala 


His 


Lys 


Ala 


Val 
55 


Leu 


He 


Ala 


Cys 


Ser 
60 


Gly 


Phe 


Phe 


Tyr 


Ser 
65 


He 


Phe 


Arg 


Gly 


Arg 
70 


Ala 


Gly 


Val 


Gly 


Val 
75 


Asp 


Val 


Leu 


Ser 


Leu 
80 


Pro 


Gly 


Gly 


Pro 


Glu 
85 


Ala 


Arg 


Gly 


Phe 


Ala 
90 


Pro 


Leu 


Leu 


Asp 


Phe 
95 


Met 


Tyr 


Thr 


Ser 


Arg 
100 


Leu 


Arg 


Leu 


Ser 


Pro 
105 


Ala 


Thr 


Ala 


Pro 


Ala 
110 


Val 


Leu 


Ala 


Ala 


Ala 
115 


Thr 


Tyr 


Leu 


Gin 


Met 
120 


Glu 


His 


Val 


Val 


Gin 
125 


Ala 


Cys 


His 


Arg 


Phe 
130 


He 


Gin 


Ala 


Ser 


Tyr 
135 


Glu 


Pro 


Leu 


Gly 


He 
140 


Ser 


Leu 


Arg 


Pro 


Leu 
145 


Glu 


Ala 


Glu 


Pro 


Pro 
150 


Thr 


Pro 


Pro 


Thr 


Ala 
155 


Pro 


Pro 


Pro 


Gly 


Ser 
160 


Pro 


Arg 


Arg 


Ser 


Glu 
165 


Gly 


His 


Pro 


Asp 


Pro 
170 


Pro 


Thr 


Glu 


Ser 


Arg 
175 


Ser 


Cys 


Ser 


Gin 


Gly 
180 


Pro 


Pro 


Ser 


Pro 


Ala 
185 


Ser 


Pro 


Asp 


Pro 


Lys 
190 


Ala 


Cys 


Asn 


Trp 


Lys 
195 


Lys 


Tyr 


Lys 


Tyr 


He 
200 


Val 


Leu 


Asn 


Ser 


Gin 
205 


Ala 


Ser 


Gin 


Ala 


Gly 
210 


Ser 


Leu 


Val 


Gly 


Glu 
215 


Arg 


Ser 


Ser 


Gly 


Gin 
220 


Pro 


Cys 


Pro 


Gin 


Ala 
225 


Arg 


Leu 


Pro 


Ser 


Gly 
230 


Asp 


Glu 


Ala 


Ser 


Ser 
235 


Ser 


Ser 


Ser 


Ser 


Ser 
240 


Ser 


Ser 


Ser 


Ser 


Ser 
245 


Glu 


Glu 


Gly 


Pro 


He 
250 


Pro 


Gly 


Pro 


Gin 


Ser 
255 


Arg 


Leu 


Ser 


Pro 


Thr 
260 


Ala 


Ala 


Thr 


Val 


Gin 
265 


Phe 


Lys 


Cys 


Gly 


Ala 
270 


Pro 


Ala 


Ser 


Thr 


Pro 
275 


Tyr 


Leu 


Leu 


Thr 


Ser 
280 


Gin 


Ala 


Gin 


Asp 


Thr 
285 


Ser 


Gly 


Ser 


Pro 


Ser Glu 
290 


Arg 


Ala 


Arg 


Pro 
295 


Leu 


Pro 


Gly 


Ser 


Glu 
300 


Phe 


Phe 


Ser 


Cys 


Gin 
305 


Asn 


Cys 


Glu 


Ala 


Val 
310 


Ala 


Gly 


Cys 


Ser 


Ser 
315 


Gly 


Leu 


Asp 


Ser 


Leu 
320 


Val 


Pro 


Gly 


Asp 


Glu 
325 


Asp 


Lys 


Pro 


Tyr 


Lys 
330 


Cys 


Gin 


Leu 


Cys 


Arg 
335 


Ser 


Ser 


Phe 


Arg 


Tyr 
340 


Lys 


Gly 


Asn 


Leu 


Ala 
345 


Ser 


His 


Arg 


Thr 


Val 
350 


His 


Thr 


Gly 


Glu 


Lys 
355 


Pro 


Tyr 


His 


Cys 


Ser 
360 


He 


Cys 


Gly 


Ala 


Arg 
365 


Phe 


Asn 


Arg 


Pro 


Ala 
370 


Asn 


Leu 


Lys 


Thr 


His 
375 


Ser 


Arg 


He 


His 


Ser 
380 


Gly 


Glu 


Lys 


Pro 


Tyr 
385 


Lys 


Cys 


Glu 


Thr 


Cys 
390 


Gly 


Ser 


Arg 


Phe 


Val 
395 


Gin 


Val 


Arg 


Ser 


Gin 
400 
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Pro 


Pro 


Ser 


Gly 


Phe 
405 


Gin 


Gly 


Lys 


Pro 


Ala 
410 


Arg 


Gly 


Gly 


Val 


Gly 
415 


Gin 


Lys 


Gly 


Gly 


Phe 
420 


Cys 


Ser 


Ser 


Gin 


Arg 
425 


Gin 


Asp 


Leu 


Lys 


Ser 
430 


Pro 


Pro 


Ser 


Gin 


Val 
435 


Ala 


His 


Leu 


Arg 


Ala 
440 


His 


Val 


Leu 


He 


His 
445 


Thr 


Gly 


Glu 


Lys 


Pro 
450 


Tyr 


Pro 


Cys 


Pro 


Thr 
455 


Cys 


Gly 


Thr 


Arg 


Phe 
460 


Arg 


His 


Leu 


Gin 


Thr 
465 


Leu 


Lys 


Ser 


His 


Val 
470 


Arg 


He 


His 


Thr 


Gly 
475 


Glu 


Lys 


Pro 


Tyr 


His 
480 


Cys 


Asp 


Pro 


Cys 


Gly 
485 


Leu 


His 


Phe 


Arg 


His 
490 


Lys 


Ser 


Gin 


Leu 


Arg 
495 


Leu 


His 


Leu 


Arg 


Gin 
500 


Lys 


His 


Gly 


Ala 


Ala 
505 


Thr 


Asn 


Thr 


Lys 


Val 
510 


His 


Tyr 


His 


He 


Leu 
515 


Gly 


Gly 


Pro 























Table 5 hEF-G nucleotide sequence (SEQ ID NO:5) 



tctttttcct 


cgcgtccttt 


gccccggaag 


tgctcttaca 


acattggctg 


ccggcgtgac 


60 


tttgaccgct 


tcccggtgcg 


ttaccggcag 


ctgaacccac 


ccggcgccac 


gggactttga 


120 


cgcgtgctct 


gcgcttgcca 


tgaqactcct 


gggagctgca 


gccgtcgcgg 


ctctggggcg 


180 


cggaagggcc 


cccgcctccc 


taggctggca 


gaggaagcag 


gttaattgga 


aggcctgccg 


240 


atggtcttca 


tcaggggtga 


ttcctaatga 


aaaaatacga 


aatattggaa 


tctcagctca 


300 


cattgattct 


gggaaaacta 


cattaacaga 


acgagtcctt 


tactacactg 


gcagaattgc 


360 


aaagatgcat 


gaggtgaaag 


gtaaagatgg 


agttggtgct 


gtcatggatt 


ccatggaact 


420 


agagagacaa 


agaggaatca 


ctattcagtc 


agcagccact 


ttcaccatgt 


ggaaagatgt 


480 


caatattaac 


attatagata 


ctcctgggca 


tgtggacttc 


acaatagaag 


tggaaagggc 


540 


cctgagagtg 


ttggatggtg 


cagtccttgt 


tctctgtgct 


gttggagggg 


tacagtgcca 


600 


gaccatgact 


gtcaatcgtc 


agatgaagcg 


ctacaacgtt 


ccgtttctaa 


cttttattaa 


660 


caaattggac 


cgaatgggct 


ccaacccagc 


cagggccctg 


cagcaaatga 


ggtctaaact 


720 


aaatcataat 


acagcgttta 


tgcagatacc 


catgggtttg 


gagggtaatt 


ttaaaggtat 


780 


tgtagatctt 


attgaggaac 


gagccatcta 


ttttgatgga 


gactttagtc 


agattgttcg 


840 


atatggtgag 


attccagctg 


aattaagggc 


ggcggccact 


gaccaccggc 


aggagctaat 


900 


tgaatgtgtt 


gccaattcag 


atgaacagct 


tggtgagatg 


tttctggaag 


aaaaaatccc 


960 


ctcgat ttct 


gatttaaagc 


tagcaattcg 


aagagctact 


ctgaaaagat 


catttactcc 


1020 


tgtatttttg 


ggaagcgcct 


tgaagaacaa 


aggagttcag 


cctcttttag 


atgctgtttt 


1080 


agaatacctc 


ccaaatccat 


ctgaagtcca 


gaactatgct 


attctcaata 


aaaaggatga 


1140 


ctcaaaagag 


aaaaccaaaa 


tcctaatgaa 


ctccagtaga 


cacaattccc 


acccatttgt 


1200 


aggcctggct 


tttcccctgg 


aggtaggtcg 


atttggacaa 


ttaacttatg 


ttcgcagtta 


1260 


tcagggagag 


ctaaagaagg 


gtgacaccat 


ctataacaca 


aggacaagaa 


agaaagtacg 


1320 


gttgcaacgg 


ctggctcgca 


tgcatgccga 


catgatggag 


gcaagtacag 


aggaagtata 


1380 


tgccggagac 


atctgtgcat 


tgtttggcat 


tgactgtgct 


agtggagaca 


cattcacaga 


1440 
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caaagccaac 

agcaatgaag 

tacaagagaa 

atctggaatg 

ctgtccttgt 

cccgtttgac 

aggtgtcctg 

attcggatca 

ctgcgagaag 

tggagcacac 

tcttaaacaa 

agttgtagct 

ggtaatcact 

aaatgatatg 

cacaatggag 

taagtatttg 

ttgcttactg 

ttccagtgga 

ttctttcttc 

taattctatg 

taaaatataa 

ttatactttt 

acatagagaa 

tgttaacggg 

tatctgttta 

tatatcatga 

tccgtatgtt 

ccaaatttcn 

gtaactgcct 

aagtagaccc 

aatcaaatgt 

ttccttggaa 

tttgtttcca 

tcatccaaaa 

gaaataaaat 



agcggccttt 

ccttctaaca 

gatcccacat 

ggagaattac 

atcacaggaa 

tttacacata 

gagcctctgg 

aatattccaa 

ggccctcttt 

cacatggttg 

gccttggcaa 

ccaaatgaat 

gggcaagatg 

tttggttatt 

tatagcaggt 

gaagctacag 

tgagttgact 

ataaattcag 

aaaagaagcc 

tctatctcaa 

ttattactga 

aagcttagaa 

aagctgaaaa 

ttgtttatca 

agtctcataa 

cacttcttgg 

cttcatagag 

ctctgtttta 

ctgttttagg 

ttgcatatac 

aatttataga 

tttatcgtat 

gaaaaagttg 

atcatctgat 

aattgatttt 



ctatggagtc 

agaacgatct 

ttaaagtata 

acctggaaat 

agccaaaagt 

aaaaacaatc 

acccagagga 

agcagtttgt 

ctggtcacaa 

attctaatga 

atgcaacatt 

ttcagggaca 

gagttgagga 

ccactgaact 

atcagccatg 

gtcaacttcc 

gactctaatt 

gctgctgaaa 

cttcttgttc 

ctctattgat 

aatatgttta 

tgtatgttca 

aatgcaaaga 

tataataatt 

ctctattttt 

agtgtcatta 

agtgtttttc 

tatatgncag 

agtataagta 

tattcttgtt 

gttagtttca 

attgtattca 

ggctttccca 

ggtatagatg 

a 



aattcatgtt 

ggaaaaattt 

ctttgacact 

ctatgctcag 

tgcctttcga 

aggtggtgca 

ctacactaaa 

gcctgctgta 

gctctctggg 

aatctctttc 

atgtattctt 

agtaattgca 

ctattttaca 

taggtcatgc 

tttaccatcc 

tgttaaaaaa 

gaatctgcgt 

caagaaattc 

atattcagga 

tggttttata 

atatttaagg 

tttccaaatt 

ataaccacat 

tgttttgtca 

agtttgctga 

atgggcaggc 

agattcttca 

tttatttcag 

ttacttcctt 

tgtgttcatc 

tcaacctaat 

ctgagattat 

agcagttcta 

gatcctagtc 



cctgatcctg 

tcaaaaggta 

gagaacaaag 

aggctggaaa 

gagaccatta 

ggccagtatg 

ttggaatttt 

gaaaaggggt 

ctccggtttg 

atccgagcag 

gaacctatta 

ggaat taacc 

ctgtatgcag 

acagagggaa 

acacaagaag 

ggaaaagcca 

ggttttgata 

tgagcccagg 

gcttctgtta 

gttcattgaa 

ggaaaagaga 

ttgtatcata 

actttccatc 

tatttgcttt 

agacttgaaa 

ttttctgttg 

ttgggatatt 

tttgtggttt 

gtggtctatt 

ttaatgtttt 

gaatgctagt 

gaagggacaa 

ttacccggtt 

cttttcatta 



tcatttcaat 

ttggcaggtt 

agacagttat 

gagagtatgg 

ctgcccctgt 

gaaaagtaat 

cagatgaaac 

ttttagatgc 

tcctgcaaga 

gagaaggtgc 

tggctgtgga 

gacgccatgg 

atgtccctct 

agggagaata 

acgtcattaa 

agaactaact 

ctttgatgga 

aagcgggctc 

tattcaaagg 

aatcctcaaa 

ctaatttcag 

agagttttca 

taccttcctt 

cactgtctat 

gtgaatcgca 

aagagtggat 

aaaatattag 

ctgcaaattt 

gtgaagtaaa 

tgtacagcta 

taaatttgaa 

atgttaatct 

cagaattgct 

cctgatggta 



Table 6 



hEF-G polypeptide (SEQ ID NO:6) 



1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

"3480 

3501 



Met 
1 


Arg 


Leu 


Leu 


Gly 
5 


Ala 


Ala 


Ala 


Val 


Ala 
10 


Ala 


Leu 


Gly 


Arg 


Gly 
15 


Arg 


Ala 


Pro 


Ala 


Ser 
20 


Leu 


Gly 


Trp 


Gin 


Arg 
25 


Lys 


Gin 


Val 


Asn 


Trp 
30 


Lys 


Ala 


Cys 


Arg 


Trp 
35 


Ser 


Ser 


Ser 


Gly 


Val 
40 


He 


Pro 


Asn 


Glu 


Lys 
45 


He 


Arg 


Asn 
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He 


Gly 
50 


He 


Ser 


Ala 


His 


He 
55 


Asp 


Ser 


Gly 


Lys 


Thr 
60 


Thr 


Leu 


Thr 


Glu 


Arg 
65 


Val 


Leu 


Tyr 


Tyr 


Thr 
70 


Gly 


Arg 


He 


Ala 


Lys 
75 


Met 


His 


Glu 


Val 


Lys 
80 


Gly 


Lys 


Asp 


Gly 


Val 
85 


Gly 


Ala 


Val 


Met 


Asp 
90 


Ser 


Met 


Glu 


Leu 


Glu 
95 


Arg 


Gin 


Arg 


Gly 


He 
100 


Thr 


He 


Gin 


Ser 


Ala 
105 


Ala 


Thr 


Phe 


Thr 


Met 
110 


Trp 


Lys 


Asp 


Val 


Asn 
115 


He 


Asn 


He 


He 


Asp 
120 


Thr 


Pro 


Gly 


His 


Val 
125 


Asp 


Phe 


Thr 


He 


Glu 
130 


Val 


Glu 


Arg 


Ala 


Leu 
135 


Arg 


Val 


Leu 


Asp 


Gly 
140 


Ala 


Val 


Leu 


Val 


Leu 
145 


Cys 


Ala 


Val 


Gly 


Gly 
150 


Val 


Gin 


Cys 


Gin 


Thr 
155 


Met 


Thr 


Val 


Asn 


Arg 
160 


Gin 


Met 


Lys 


Arg 


Tyr 
165 


Asn 


Val 


Pro 


Phe 


Leu 
170 


Thr 


Phe 


He 


Asn 


Lys 
175 


Leu 


Asp 


Arg 


Met 


Gly 
180 


Ser 


Asn 


Pro 


Ala 


Arg 
185 


Ala 


Leu 


Gin 


Gin 


Met 
190 


Arg 


Ser 


Lys 


Leu 


Asn 
195 


His 


Asn 


Thr 


Ala 


Phe 
200 


Met 


Gin 


He 


Pro 


Met 
205 


Gly 


Leu 


Glu 


Gly 


Asn 
210 


Phe 


Lys 


Gly 


He 


Val 
215 


Asp 


Leu 


He 


Glu 


Glu 
220 


Arg 


Ala 


He 


Tyr 


Phe 
225 


Asp 


Gly 


Asp 


Phe 


Ser 
230 


Gin 


He 


Val 


Arg 


Tyr 
235 


Gly 


Glu 


He 


Pro 


Ala 
240 


Glu 


Leu 


Arg 


Ala 


Ala 
245 


Ala 


Thr 


Asp 


His 


Arg 
250 


Gin 


Glu 


Leu 


He 


Glu 
255 


Cys 


Val 


Ala 


Asn 


Ser 
260 


Asp 


Glu 


Gin 


Leu 


Gly 
265 


Glu 


Met 


Phe 


Leu 


Glu 
270 


Glu 


Lys 


He 


Pro 


Ser 
275 


He 


Ser 


Asp 


Leu 


Lys 
280 


Leu 


Ala 


He 


Arg 


Arg 
285 


Ala 


Thr 


Leu 


Lys 


Arg 
290 


Ser 


Phe 


Thr 


Pro 


Val 
295 


Phe 


Leu 


Gly 


Ser 


Ala 
300 


Leu 


Lys 


Asn 


Lys 


Gly 
305 


Val 


Gin 


Pro 


Leu 


Leu 
310 


Asp 


Ala 


Val 


Leu 


Glu 
315 


Tyr 


Leu 


Pro 


Asn 


Pro 
320 


Ser 


Glu 


Val 


Gin 


Asn 
325 


Tyr 


Ala 


He 


Leu 


Asn 
330 


Lys 


Lys 


Asp 


Asp 


Ser 
335 


Lys 


Glu 


Lys 


Thr 


Lys 
340 


He 


Leu 


Met 


Asn 


Ser 
345 


Ser 


Arg 


His 


Asn 


Ser 
350 


His 


Pro 


Phe 


Val 


Gly 
355 


Leu 


Ala 


Phe 


Pro 


Leu 
360 


Glu 


Val 


Gly 


Arg 


Phe 
365 


Gly 


Gin 


Leu 


Thr 


Tyr 
370 


Val 


Arg 


Ser 


Tyr 


Gin 
375 


Gly 


Glu 


Leu 


Lys 


Lys 
380 


Gly 


Asp 


Thr 


He 


Tyr 
385 


Asn 


Thr 


Arg 


Thr 


Arg 
390 


Lys 


Lys 


Val 


Arg 


Leu 
395 


Gin 


Arg 


Leu 


Ala 


Arg 
400 
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Met 


His 


Ala 


Asp 


Met 
405 


Met 


Glu 


Ala 


Ser 


Thr 
410 


Glu 


Glu 


Val 


Tyr 


Ala 
415 


Gly 


Asp 


He 


Cys 


Ala 
420 


Leu 


Phe 


Gly 


He 


Asp 
425 


Cys 


Ala 


Ser 


Gly 


Asp 
430 


Thr 


Phe 


Thr 


Asp 


Lys 
435 


Ala 


Asn 


Ser 


Gly 


Leu 
440 


Ser 


Met 


Glu 


Ser 


He 
445 


His 


Val 


Pro 


Asp 


Pro 
450 


Val 


He 


Ser 


He 


Ala 
455 


Met 


Lys 


Pro 


Ser 


Asn 
460 


Lys 


Asn 


Asp 


Leu 


Glu 
465 


Lys 


Phe 


Ser 


Lys 


Gly 
470 


He 


Gly 


Arg 


Phe 


Thr 
475 


Arg 


Glu 


Asp 


Pro 


Thr 
480 


Phe 


Lys 


Val 


Tyr 


Phe 
485 


Asp 


Thr 


Glu 


Asn 


Lys 
490 


Glu 


Thr 


Val 


He 


Ser 
495 


Gly 


Met 


Gly 


Glu 


Leu 
500 


His 


Leu 


Glu 


He 


Tyr 
505 


Ala 


Gin 


Arg 


Leu 


Glu 
510 


Arg 


Glu 


TV2T 


Gly 


Cys 
515 


Pro 


Cys 


He 


Thr 


Gly 
520 


Lys 


Pro 


Lys 


Val 


Ala 
525 


Phe 


Arg 


Glu 


Thr 


He 
530 


Thr 


Ala 


Pro 


Val 


Pro 
535 


Phe 


Asp 


Phe 


Thr 


His 
540 


Lys 


Lys 


Gin 


Ser 


Gly 
545 


Gly 


Ala 


Gly 


Gin 


Tyr 
550 


Gly 


Lys 


Val 


He 


Gly 
555 


Val 


Leu 


Glu 


Pro 


Leu 
560 


Asp 


Pro 


Glu 


Asp 


Tyr 
565 


Thr 


Lys 


Leu 


Glu 


Phe 
570 


Ser 


Asp 


Glu 


Thr 


Phe 
575 


Gly 


Ser 


Asn 


He 


Pro 
580 


Lys 


Gin 


Phe 


Val 


Pro 
585 


Ala 


Val 


Glu 


Lys 


Gly 
590 


Phe 


Leu 


Asp 


Ala 


Cys 
595 


Glu 


Lys 


Gly 


Pro 


Leu 
600 


Ser 


Gly 


His 


Lys 


Leu 
605 


Ser 


Gly 


Leu 


Arg 


Phe 
610 


Val 


Leu 


Gin 


Asp 


Gly 
615 


Ala 


His 


His 


Met 


Val 
620 


Asp 


Ser 


Asn 


Glu 


He 
625 


Ser 


Phe 


He 


Arg 


Ala 
630 


Gly 


Glu 


Gly 


Ala 


Leu 
635 


Lys 


Gin 


Ala 


Leu 


Ala 
640 


Asn 


Ala 


Thr 


Leu 


Cys 
645 


He 


Leu 


Glu 


Pro 


He 
650 


Met 


Ala 


Val 


Glu 


Val 
655 


Val 


Ala 


Pro 


Asn 


Glu 
660 


Phe 


Gin 


Gly 


Gin 


Val 
665 


He 


Ala 


Gly 


He 


Asn 
670 


Arg 


Arg 


His 


Gly 


Val 
675 


He 


Thr 


Gly 


Gin 


Asp 
680 


Gly 


Val 


Glu 


Asp 


Tyr 
685 


Phe 


Thr 


Leu 




Ala 
690 


Asp 


Val 


Pro 


Leu 


Asn 
695 


Asp 


Met 


Phe 


Gly 


Tyr 
700 


Ser 


Thr 


Glu 


Leu 


Arg 
705 


Ser 


Cys 


Thr 


Glu 


Gly 
710 


Lys 


Gly 


Glu 


Tyr 


Thr 
715 


Met 


Glu 


Tyr 


Ser 


Arg 
720 


Tyr 


Gin 


Pro 


Cys 


Leu 
725 


Pro 


Ser 


Thr 


Gin 


Glu 
730 


Asp 


Val 


He 


Asn 


Lys 
735 


Tyr 


Leu 


Glu 


Ala 


Thr 
740 


Gly 


Gin 


Leu 


Pro 


Val 
745 


Lys 


Lys 


Gly 


Lys 


Ala 
750 


Lys 


Asn 
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Table 7 hTRG nucleotide sequence (SEQ ID NO:7) 

gccgcgggag caggcggagg cggaggcggc gggggcagga ggatatcgca gccgccgctg SO 

ctccccgcct cggcggagac tcggaagttc acccgggcgc tgagtaagcc gggcacggcg 12 0 

gccgagctgc ggcagagcgt gtctgaggtg gtgcgcggct ccgtgctcct ggcaaagcca 180 

aagctaattg agccactcga ctatgaaaat gtcatcgtcc agaagaagac tcagatcctg 240 

aacgactgtt tacgggagat gctgctcttc ccttacgatg actttcagac ggccatcctg 300 

agacgacagg gtcgatacat atgctcaaca gtgcctgcga aggcggaaga ggaagcacag 360 

agcttgtttg ttacagagtg catcaaaacc tataactctg actggcatct tgtgaactat 420 

aaatatgaag attactcagg agagtttcga cagcttccga acaaagtggt caagttggat 480 

aaacttccag ttcatgtcta tgaagttgac gaggaggtcg acaaagatga ggatgctgcc 540 

tcccttggtt cccagaaagg tgggatcacc aagcatggct ggctgtacaa aggcaacatg 6 00 

aacagtgcca tcagcgtgac catgaggtca tttaagagac gatttttcca cctgattcaa 660 

cttggcgatg gatcctataa atttgaattt ttaaaagatc tccaaaagga accaaaagga 720 

tcaatatttc tgggattcct gtatggggtg tcgttcagga acaacaaagt caggcgtttt 780 

gcttttgagc tcaagatgca ggacaaaagt agttatctct tggcagcaga cagtgaagtg 840 

gaaatggaag aatggatcac aattctaaat aagatcctcc agctcaactt tgaagctgca 900 

atgcaagaaa agcgaaatgg cgactctcac gaagatgatg aacaaagcaa attggaaggt 960 

tctggttccg gtttagatag ctacctgccg gaacttgcca agagtgcaag agaagcagaa 1020 

atcaaactga aaagtgaaag cagagtcaaa cttttttatt tggacccaga tgcccagaag 1080 

cttgacttct catcagctga gccagaagtg aagtcatttg aagagaagtt tggaaaaagg 1140 

atccttgtca agtgcaatga tttatctttc aatttgcaat gctgtgttgc cgaaaatgaa 1200 

gaaggaccca ctacaaatgt tgaacctttc tttgttactc tatccctgtt tgacataaaa 1260 

tacaaccgga agatttctgc cgatttccac gtagacctga accatttctc agtgaggcaa 1320 

atgatcgcca ccacgtcccc ggcgctgatg aatggcagtg ggccggaaac ccaatctgcc 1380 

ctcaggggca tccttcatga agccgccatg cagtatccga agcagggaat attttcagtc 1440 

acttgtcctc atccagatat atttcttgtg gccagaattg aaaaagtcct tcaggggagc 1500 

atcacacatt gcgctgagcc atatatgaaa agttcagact cttctaaggt ggcccagaag 1560 

gtgctgaaga atgccaagca ggcatgccaa agactaggac agtatagaat gccatttgct 1620 

tgggcagcaa ggacattgtt taaggatgca tctggaaatc ttgacaaaaa tgccagattt 1680 

tctgccatct acaggcaaga cagcaataag ctatccaatg atgacatgct caagttactt 1740 

gcagactttc ggaaacctga. gaagatggct aagctcccag tgattttagg caatctagac 1800 

attacaattg ataatgtttc ctcagacttc cctaattatg ttaattcatc atacattccc 1860 

acaaaacaat ttgaaacctg cagtaaaact cccatcacgt ttgaagtgga ggaatttgtg 1920 

ccctgcatac caaaacacac tcagccttac accatctaca ccaatcacct ttacgtttat 1980 

cctaagtact tgaaatacga cagtcagaag tcttttgcca aggctagaaa tattgcgatt 2040 

tgcattgaat tcaaagattc agatgaggaa gactctcagc cccttaagtg catttatggc 2100 

agacctggtg ggccagtttt cacaagaagc gcctttgctg cagttttaca ccatcaccaa 2160 

aacccagaat tttatgatga gattaaaata gagttgccca ctcagctgca tgaaaagcac 2220 

cacctgttgc tcacattctt ccatgtcagc tgtgacaact caagtaaagg aagcacgaag 2280 

aagagggatg tcgttgaaac ccaagttggc tactcctggc ttcccctcct gaaagacgga 2340 

agggtggtga caagcgagca gcacatcccg gtctcggcga accttccttc gggctatctt 2400 
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ggctaccagg agcttgggat gggcaggcat t atggtccgg aaattaaatg ggtagatgga 
ggcaagccac tgctgaaaat ttccactcat ctggtttcta cagtgtatac -aggatcag 
catttacata attttttcca gtactgtcag aaaaccgaat ctggagccca agccttagga 
aacgaacttg taaagtacct taagagtctg catgcgatgg aaggccacgt gatgatcgcc 
ttcttgccca ctatcctaaa ccagctgttc cgagtcctca ccagagccac acaggaagaa 
gtcgcggtta acgtgactcg ggtcattatt catgtggttg cccagtgcca ^aggaagga 
Lggagagcc acttgaggtc atatgttaag tacgcgtata aggctgagcc atatgttgcc 
tctgaataca agacagtgca tgaagaactg accaaatcca tgaccacgat tctcaagcct 
tctgccgatt tcctcaccag caacaaacta ctgaagtact catggttttt ctttgatgta 
ctgatcaaat ctatggctca gcatttgata gagaactcca aagttaagtt gctgcgaaac 
cagagatttc ctgcatccta tcatcatgca gtggaaaccg ttgtaaatat gctgatgcca 
cacatcactc agaagtttcg agataatcca gaggcatcta agaacgcgaa tcatagcctt 
gctgtcttca tcaagagatg tttcaccttc atggacaggg gctttgtctt caagcagatc 
aacaactaca ttagctgttt tgctcctgga gacccaaaga ccctctttga atacaagttt 
gaatttctcc gtgtagtgtg caaccatgaa cattatattc cgttgaactt accaatgcca 
tttggaaaag gcaggattca aagataccaa gacctccagc ttgactactc attaacagat 
gagttctgca gaaaccactt cttggtggga ctgttactga gggaggtggg gacagccctc 
caggagttcc gggaggtccg tctgatcgcc atcagtgtgc tcaagaacct gctgataaag 
cattcttttg atgacagata tgcttcaagg agccatcagg caaggatagc caccc c 
ctgcctctgt ttggtctgct gattgaaaac gtccagcgga tcaatgtgag ggatgtgtca 
cccttccctg tgaacgcggg catgactgtg aaggatgaat ccctggctct accagctgtg 
aatccgctgg tgacgccgca gaagggaagc accctggaca acagcctgca caaggacctg 
ctgggcgcca tctccggcat tgcttctcca tatacaacct caactccaaa catcaacagt 
gtgagaaatg ctgattcgag aggatctctc ataagcacag attcgggtaa cagcc cca 
gaaaggaata gtgagaagag caattccctg gataagcacc aacaaagtag c«at ^ 
aattccgtgg ttcgctgtga taaacttgac cagtctgaga ttaagagcct actgatgtgt 
ttcctctaca tcttaaagag catgtctgat gatgctttgt ttacatattg gaacaaggct 
tcaacatctg aacttatgga tttttttaca atatctgaag tctgcctgca ccagttccag 
tacatgggga agcgatacat agccagaaca ggaatgatgc atgccagatt gcagcagctg 
ggcagcctgg ataactctct cacttttaac cacagctatg gccactcgga cgcagatgtt 
ctgcaccagt cattacttga agccaacatt gctactgagg tttgcctgac agctctggac 
acgctttctc tatttacatt ggcgtttaag aaccagctcc tggccgacca tggacataat 
cctctcatga aaaaagtttt tgatgtctac ctgtgttttc ttcaaaaaca tcagtctgaa 
acggctttaa aaaatgtctt cactgcctta aggtccttaa tttataagtt tccctcaaca 
ttctatgaag ggagagcgga catgtgtgcg gctctgtgtt acgagattct caagtgctgt 
aactccaagc tgagctccat caggacggag gcctcccagc tgctctactt cctgatgagg 
aacaactttg attacactgg aaagaagtcc tttgtccgga cacatttgca agtcatcata 
tctgtcagcc agctgatagc agacgttgtt ggcattgggg gaaccagatt ccagcagtcc 
ctgtccatca tcaacaactg tgccaacagt gaccggctta ttaagcacac cagcttctcc 
tctgatgtga aggacttaac caaaaggata cgcacggtgc taatggccac cgcccagatg 
aaggagcatg agaacgaccc agagatgctg gtggacctcc agtacagcct ggccaaatcc 
tatgccagca cgcccgagct caggaagacg tggctcgaca gcatggccag gatccatgtc 
aaaaatggcg atctctcaga ggcagcaatg tgctatgtcc acgtaacagc cctagtggca 
gaatatctca cacggaaaga agcagtccag tgggagccgc cccttctccc ccacagccat 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
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agcgcctgcc tgaggaggag ccggggaggc gtgtttagac aaggatgcac cgccttcagg 5100 

gtcattaccc caaacatcga cgaggaggcc tccatgatgg aagacgtggg gatgcaggat 5160 

gtccatttca acgaggatgt gctgatggag ctccttgagc agtgcgcaga tggactctgg 5220 

aaagccgagc gctacgagct cattgccgac atctacaaac ttatcatccc catttatgag 5280 

aagcggaggg attttgagag gctggcccat • ctgtatgaca cgctgcaccg ggcctacagc 5340 

aaagtgaccg aggtcatgca ctcgggccgc aggcttctgg ggacctactt ccgggtagcc 5400 

ttcttcgggc aggcagcgca ataccagttt acagacagtg aaacagatgt ggagggattc 5460 

tttgaagatg aagatggaaa ggagtatatt tacaaggaac ccaaactcac accgctgtcg 5520 

gaaatttctc agagactcct taaactgtac tcggataaat ttggttctga aaatgtcaaa 5580 

atgatacagg attctggcaa ggtcaaccct aaggatctgg attctaagta tgcctacatc 5640 

caggtgactc acgtcatccc cttctttgac gaaaaagagt tgcaagaaag gaaaacagag 5700 

tttgagagat cccacaacat ccgccgcttc atgtttgaga tgccatttac gcagaccggg 5760 

aagaggcagg gcggggtgga agagcagtgc aaacggcgca ccatcctgac agccatacac 582 0 

tgcttccctt atgtgaagaa gcgcatccct gtcatgtacc agcaccacac tgacctgaac 5880 

cccatcgagg. tggccattga cgagatgagt aagaaggtgg cggagctccg gcagctgtgc 5940 

tcctcggccg aggtggacat gatcaaactg cagctcaaac tccagggcag cgtgagtgtt 6000 

caggtcaatg ctggcccact agcatatgcg cgagctttct tagatgatac aaacacaaag 6060 

cgatatcctg acaataaagt gaagctgctt aaggaagttt tcaggcaatt tgtggaagct 6120 

tgcggtcaag ccttagcggt aaacgaacgt ctgattaaag aagaccagct cgagtatcag 6180 

gaagaaatga aagccaacta cagggaaatg gcgaaggagc tttctgaaat catgcatgag 6240 

cagatctgcc ccctggagga gaagacgagc gtcttaccga attcccttca catcttcaac 6300 

gccatcagtg ggactccaac aagcacaatg gttcacggga tgaccagctc gtcttcggtc 6360 

ata toa ttac atctcatggc ccgtgtgtgg ggacttgctt tgtcatttgc aaactcagga 6420 

tgctttccaa agccaatcac tggggagacc gagcacaggg aggaccaagg ggaaggggag 648 0 

agaaaggaaa taaagaacaa cgttatttct taacagactt tctataggag ttgtaagaag 6540 

gtgcacatat ttttttaaat ctcactggca atattcaaag ttttcattgt gtcttaacaa 6600 

aggtgtggta gacactcttg agctggactt agattttatt cttccttgca gagtagtgtt 6660 

agaatagatg gcctacagaa aaaaaaggtt ctgggatcta catggcaggg agggctgcac 672 0 

tgacattgat gcctggggga ccttttgcct cgaggctgag ctggaaaatc ttgaaaatat 6780 

tttttttttc ctgtggcaca ttcaggttga atacaagaac tatttttgtg actagttttt 6840 

gatgacctaa gggaactgac cattgtaatt tttgtaccag tgaaccagga gatttagtgc 6900 

ttttatattc atttccttgc atttaagaaa atatgaaagc ttaaggaatt atgtgagctt 6960 

aaaactagtc aagcagttta gaaccaaagg cctatattaa taaccgcaac tatgctgaaa 7020 

agtacaaagt agtacagtat attgttatgt acatatcatt gttaatacag tcctggcatt 7080 

ctgtacatat atgtattaca tttctacatt tttaatactc acatgggctt atgcattaag 7140 

tttaattgtg ataaatttgt gctgttccag tatatgcaat acactttaat gttttattct 7200 

tgtacataaa aatgtgcaat atggagatgt atacagtctt tactatatta ggtttataaa 7260 

cagttttaag aatttcatcc ttttgccaaa atggtggagt atgtaattgg taaatcataa 7320 

atcctgtggt gaatggtggt gtactttaaa gctgtcacca tgttatattt tcttttaaga 7380 

ctttaattta gtaattttat atttgggaaa ataaaggttt ttaattttat ttaactggaa 7440 

tcactgccct gctgtaatta aacattctgt accacatctg tattaaaaag acattgctga 7500 

ccatta 7506 
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Table 8 hTRG polypeptide sequence (SEQ ID NO: 8) 



Met Ser Gin Pro Pro Le u Leu Pro Ala Ser Ala Glu Thr Arg Lys Phe 
1 5 10 15 

Thr Arg Ala Leu Ser Lys Pro Gly Thr Ala Ala Glu Leu Arg Gin Ser 
20 25 JU 

Val Ser Glu Val Val Arg Gly Ser Val Leu Leu Ala Lys Pro Lys Leu 
35 40 45 

He Glu Pro Leu Asp Tyr Glu Asn Val He Val Gin Lys Lys Thr Gin 
50 55 60 

lie Leu Asn Asp Cys Leu Arg Glu Met Leu Leu Phe Pro Tyr Asp Asp 
65 70 75 80 

Phe Gin Thr Ala He Leu Arg Arg Gin Gly Arg Tyr He Cys Ser Thr 
85 9° 95 

Val Pro Ala Lys Ala Glu Glu Glu Ala Gin Ser Leu Phe Val Thr Glu 
100 105 HO 

Cvs He Lys Thr Tyr Asn Ser Asp Trp His Leu Val Asn Tyr Lys Tyr 
H5 120 125 

Glu Asp Tyr Ser Gly Glu Phe Arg Gin Leu Pro Asn Lys Val Val Lys 
130 135 140 

Leu Asp Lys Leu Pro Val His Val Tyr Glu Val Asp Glu Glu Val Asp 

145 150 155 

Lys Asp Glu Asp Ala Ala Ser Leu Gly Ser Gin Lys Gly Gly He Thr 

165 170 175 

Lys His Gly Trp Leu Tyr Lys Gly Asn Met Asn Ser Ala lie Ser Val 
180 185 1^0 

Thr Met Arg Ser Phe Lys Arg Arg Phe Phe His Leu He Gin Leu Gly 
195 200 205 

Asp Gly Ser Tyr Lys Phe Glu Phe Leu Lys Asp Leu Gin Lys Glu Pro 
210 215 220 

Lys Gly Ser He Phe Leu Gly Phe Leu Tyr Gly Val Ser Phe Arg Asn 
225 230 235 ^40 

Asn Lys Val Arg Arg Phe Ala Phe Glu Leu Lys Met Gin Asp Lys Ser 
245 250 255 

Ser Tyr Leu Leu Ala Ala Asp Ser Glu Val Glu Met Glu Glu Trp He 
260 265 270 

Thr He Leu Asn Lys He Leu Gin Leu Asn Phe Glu Ala Ala Met Gin 
275 280 285 

Glu Lys Arg Asn Gly Asp Ser His Glu Asp Asp Glu Gin Ser Lys Leu 
290 295 300 

Glu Gly Ser Gly Ser Gly Leu Asp Ser Tyr Leu Pro Glu Leu Ala Lys 
305 310 315 320 

Ser Ala Arg Glu Ala Glu He Lys Leu Lys Ser Glu Ser Arg Val Lys 
325 330 ->->3 

Leu Phe Tvr Leu Asp Pro Asp Ala Gin Lv s Leu Asp Phe Ser Ser Ala 
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340 










345 










350 






Glu 


Pro 


Glu 
355 


Val 


Lys 


Ser 


Phe 


Glu 
360 


Glu 


Lys 


Phe 


Gly 


Lys 
365 


Arg 


He 


Leu 


Val 


Lys 
370 


Cys 


Asn 


Asp 


Leu 


Ser 
375 


Phe 


Asn 


Leu 


Gin 


Cys 
380 


Cys 


Val 


Ala 


Glu 


Asn 
385 


Glu 


Glu 


Gly 


Pro 


Thr 
390 


Thr 


Asn 


Val 


Glu 


Pro 
395 


Phe 


Phe 


Val 


Thr 


Leu 
400 


Ser 


Leu 


Phe 


Asp 


He 
405 


Lys 


Tyr 


Asn 


Arg 


Lys 
410 


He 


Ser 


Ala 


Asp 


Phe 
415 


His 


Val 


Asp 


Leu 


Asn 
420 


His 


Phe 


Ser 


Val 


Arg 
425 


Gin 


Met 


He 


Ala 


Thr 
430 


Thr 


Ser 


Piro 


Ala 


Leu 
435 


Met 


Asn 


Gly 


Ser 


Gly 
440 


Pro 


Glu 


Thr 


Gin 


Ser 
445 


Ala 


Leu 


Arg 


Glv 


lie 
450 


Leu 


His 


Glu 


Ala 


Ala 
455 


Met 


Gin 


Tyr 


Pro 


Lys 
460 


Gin 


Gly 


He 


Phe 


Ser 
465 


Val 


Thr 


Cys 


Pro 


His 
470 


Pro 


Asp 


He 


Phe 


Leu 
475 


Val 


Ala 


Arg 


He 


Glu 
480 


Lys 


Val 


Leu 


Gin 


Gly 
485 


Ser 


He 


Thr 


His 


Cys 
490 


Ala 


Glu 


Pro 


Tyr 


Met 
495 


Lys 


Ser 


Ser 


Asp 


Ser 
500 


Ser 


Lys 


Val 


Ala 


Gin 
505 


Lys 


Val 


Leu 


Lys 


Asn 
510 


Ala 


Lys 


Gin 


Ala 


Cys 
515 


Gin 


Arg 


Leu 


Gly 


Gin 
520 


Tyr 


Arg 


Met 


Pro 


Phe 
525 


Ala 


Trp 


Ala 


Ala 


Arg 
530 


Thr 


Leu 


Phe 


Lys 


Asp 
535 


Ala 


Ser 


Gly 


Asn 


Leu 
540 


Asp 


Lys 


Asn 


Ala 


Arg 
545 


Phe 


Ser 


Ala 


He 


Tyr 
550 


Arg 


Gin 


Asp 


Ser 


Asn 
555 


Lys 


Leu 


Ser 


Asn 


Asp 
560 


Asp 


Met 


Leu 


Lys 


Leu 
565 


Leu 


Ala 


Asp 


Phe 


Arg 
570 


Lys 


Pro 


Glu 


Lys 


Met 
575 


Ala 


Lys 


Leu 


Pro 


Val 
580 


He 


Leu 


Gly 


Asn 


Leu 
585 


Asp 


He 


Thr 


He 


Asp 
590 


Asn 


Val 


Ser 


Ser 


Asp 
595 


Phe 


Pro 


Asn 


Tyr 


Val 
600 


Asn 


Ser 


Ser 


Tyr 


He 
605 


Pro 


Thr 


Lys 


Gin 


Phe 
610 


Glu 


Thr 


Cys 


Ser 


Lys 
615 


Thr 


Pro 


He 


Thr 


Phe 
620 


Glu 


Val 


Glu 


Glu 


Phe 
625 


Val 


Pro 


Cys 


He 


Pro 
630 


Lys 


His 


Thr 


Gin 


Pro 
635 


Tyr 


Thr 


He 


Tyr 


Thr 
640 


Asn 


His 


Leu 


Tyr 


Val 
645 


Tyr 


Pro 


Lys 


Tyr 


Leu 
650 


Lys 


Tyr 


Asp 


Ser 


Gin 
655 


Lys 


Ser 


Phe 


Ala 


Lys 
660 


Ala 


Arg 


Asn 


He 


Ala 
665 


He 


Cys 


He 


Glu 


Phe 
670 


Lys 


Asp 


Ser 


Asp 


Glu 
675 


Glu 


Asp 


Ser 


Gin 


Pro 
680 


Leu 


Lys 


Cys 


He 


Tyr 
685 


Gly Arg 


Pro 


Gly Gly 


Pro 


Val 


Phe 


Thr 


Arg 


Ser 


Ala 


Phe 


Ala 


Ala 


Val 


Leu 


His 


His 
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690 










695 










700 










His 
705 


Gin 


Asn 


Pro 


Glu 


Phe 
710 


Tyr 


Asp 


Glu 


He 


Lys 
715 


He 


Glu 


Leu 


Pro 


Thr 
720 


Gin 


Leu 


His 


Glu 


Lys 
725 


His 


His 


Leu 


Leu 


Leu 
730 


Thr 


Phe 


Phe 


His 


Val 
735 


Ser 


Cys 


Asp 


Asn 


Ser 
740 


Ser 


Lys 


Gly 


Ser 


Thr 
745 


Lys 


Lys 


Arg 


Asp 


Val 
750 


Val 


Glu 


Thr 


Gin 


Val 
755 


Gly 


Tyr 


Ser 


Trp 


Leu 
760 


Pro 


Leu 


Leu 


Lys 


Asp 
765 


Gly 


Arg 


Val 


Val 


Thr 
770 


Ser 


Glu 


Gin 


His 


He 
775 


Pro 


Val 


Ser 


Ala 


Asn 
780 


Leu 


Pro 


Ser 


Gly 


Tyr 
785 


Leu 


Gly 


Tyr 


Gin 


Glu 
790 


Leu 


Gly 


Met 


Gly 


Arg 
7 95 


His 


Tyr 


Gly 


Pro 


Glu 
800 


He 


Lys 


Trp 


Val 


Asp 
805 


Gly 


Gly 


Lys 


Pro 


Leu 
810 


Leu 


Lys 


He 


Ser 


Thr 
815 


His 


Leu 


Val 


Ser 


Thr 
820 


Val 


Tyr 


Thr 


Gin 


Asp 
825 


Gin 


His 


Leu 


His 


Asn 
830 


Phe 


Phe 


Gin 


Tyr 


Cys 
835 


Gin 


Lys 


Thr 


Glu 


Ser 
840 


Gly 


Ala 


Gin 


Ala 


Leu 
845 


Gly 


Asn 


Glu 


Leu 


Val 
850 


Lys 


Tyr 


Leu 


Lys 


Ser 
855 


Leu 


His 


Ala 


Met 


Glu 
860 


Gly 


His 


Val 


Met 


He 
865 


Ala 


Phe 


Leu 


Pro 


Thr 
870 


He 


Leu 


Asn 


Gin 


Leu 
875 


Phe 


Arg 


Val 


Leu 


Thr 
880 


Arg 


Ala 


Thr 


Gin 


Glu 
885 


Glu 


Val 


Ala 


Val 


Asn 
890 


Val 


Thr 


Arg 


Val 


He 
895 


He 


His 


Val 


Val 


Ala 
900 


Gin 


Cys 


His 


Glu 


Glu 
905 


Gly 


Leu 


Glu 


Ser 


His 
910 


Leu 


Arg 


Ser 


Tyr 


Val 
915 


Lys 


Tyr 


Ala 


Tyr 


Lys 
920 


Ala 


Glu 


Pro 


Tyr 


Val 
925 


Ala 


Ser 


Glu 


Tyr 


Lys 
930 


Thr 


Val 


His 


Glu 


Glu 
935 


Leu 


Thr 


Lys 


Ser 


Met 
940 


Thr 


Thr 


He 


Leu 


Lys 
945 


Pro 


Ser 


Ala 


Asp 


Phe 
950 


Leu 


Thr 


Ser 


Asn 


Lys 
955 


Leu 


Leu 


Lys 


Tyr 


Ser 
960 


Trp 


Phe 


Phe 


Phe 


Asp 
965 


Val 


Leu 


He 


Lys 


Ser 
970 


Met 


Ala 


Gin 


His 


Leu 
975 


He 


Glu 


Asn 


Ser 


Lys 
980 


Val 


Lys 


Leu 


Leu 


Arg 
985 


Asn 


Gin 


Arg 


Phe 


Pro 
990 


Ala 


Ser 


Tyr 
He 


His 


His 
995 


Ala 


Val 


Glu 


Thr 


Val Val 
1000 


Asn Met Leu Met Pro His 
1005 


Thr 


Gin 
1010 


Lys Phe Arg Asp Asn Pro Glu Ala Ser Lys Asn Ala Asn 
1015 1020 


His 


Ser 
1025 


Leu Ala Val 


Phe He Lys Arg Cys Phe Thr Phe Met Asp 
1030 1035 
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Arg Gly Phe Val Phe Lys G in He Asn Asn Tyr lie ser Cys Phe 

1040 1045 1 

Ala Pro Gly Asp Pro Lys Thr Leu Phe Glu Tyr Lys Phe Glu Phe 

1055 1060 

Leu Arg Val Val Cys Asn His Glu His Tyr lie P ro Leu Asn Leu 

1070 107b 

Pro Met Pro Phe Gly Lys Gl^ Arg He Gin Arg Ty^ Gin Asp Leu 

Gin Leu Asp Tyr Ser Leu Thr Asp Glu Phe Cys Arg Asn His Phe 

1100 1105 
Leu Val Gly Leu Leu Leu Arg Glu Val Gly Thr Ala^ Leu Gin Glu 

1115 112° 1125 

Phe Arg Glu Val Arg Leu lie Ala He Ser Val LeU Q Lys Asn Leu 

1130 llib 
Leu He Lys His Ser Phe Asp Asp Arg Tyr Ala Ser Arg Ser His 

1145 I* 50 llbb 

Gin Ala Arg lie Ala Thr Leu Tyr Leu Pro Leu Phe Gly Leu Leu 

1160 1165 
He Glu Asn Val Gin Arg He Asn Val Arg Asp Val Ser Pro Phe 

1175 1180 
Pro Val Asn Ala Gly Met Thr Val Lys Asp Glu Ser Leu Ala Leu 

1190 1195 
Pro Ala Val Asn Pro Leu Val Thr Pro Gin Lys Gly Ser Thr Leu 

1205 1210 
Asp Asn Ser Leu His Lys Asp Leu Leu Gly Ala lie Ser Gly He 

1220 1225 
Ala Ser Pro Tyr Thr Thr Ser Thr Pro Asn lie Asn Ser Val Arg 

1235 1240 1245 

Asn Ala Asp Ser Arg Gly Ser Leu He Ser Thr AsP Q Ser Gly Asn 

1250 125b 
Ser Leu Pro Glu Arg Asn Ser Glu Lys Ser Asn Ser Leu Asp Lys 

1265 1270 
His Gin Gin Ser Ser Thr Leu Gly Asn Ser Val Val Arg Cys Asp 

1280 I 285 1290 

Lys Leu Asp Gin Ser Glu He Lys Ser Leu Leu Met Cys Phe Leu 

1295 1300 130b 

Tyr He Leu Lys Ser Met Ser Asp Asp Ala Leu Phe Thr Tyr Trp 

1310 13 15 1320 

Asn Lys Ala Ser Thr Ser Glu Leu Met Asp Phe Phe Thr He Ser 

1325 1330 
Glu Val Q Cys Leu His Gin Ph^ Gin Tyr Met Gly LyS Q Arg Tyr He 

Ala Arg Thr Gly Met Met His Ala Arg Leu Gin Gin Leu Gly Ser 
1355 1360 1 
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Leu Asp Asn Ser Leu Thr Phe Asn His Ser Tyr Gly His Ser Asp 
1370 1375 1380 

Ala Asp Val Leu His Gin Ser Leu Leu Glu Ala Asn lie Ala Thr 
1385 1390 1395 

Glu Val Cys Leu Thr Ala Leu Asp Thr Leu Ser Leu Phe Thr Leu 
1400 1405 1410 

Ala Phe Lys Asn Gin Leu Leu Ala Asp His Gly His Asn Pro Leu 
1415 1420 1425 

Met Lys Lys Val Phe Asp Val Tyr Leu Cys Phe Leu Gin Lys His 
1430 1435 1440 

Gin Ser Glu Thr Ala Leu Lys Asn Val Phe Thr Ala Leu Arg Ser 
1445 1450 1455 

Leu lie Tyr Lys Phe Pro Ser Thr Phe Tyr Glu Gly Arg Ala Asp 
1460 1465 1470 

Met Cys Ala Ala Leu Cys Tyr Glu lie Leu Lys Cys Cys Asn Ser 
1475 ' 1480 1485 

Lys Leu Ser Ser lie Arg Thr Glu Ala Ser Gin Leu Leu Tyr Phe 
1490 1495 1500 

Leu Met Arg Asn Asn Phe Asp Tyr Thr Gly Lys Lys Ser Phe Val 
1505 1510 1515 

Arg Thr His Leu Gin Val lie lie Ser Val Ser Gin Leu lie Ala 
1520 1525 1530 

Asp Val Val Gly lie Gly Gly Thr Arg Phe Gin Gin Ser Leu Ser 
1535 1540 1545 

lie lie Asn Asn Cys Ala Asn Ser Asp Arg Leu lie Lys His Thr 
1550 1555 1560 

Ser Phe Ser Ser Asp Val Lys Asp Leu Thr Lys Arg lie Arg Thr 
1565 1570 1575 

Val Leu Met Ala Thr Ala Gin Met Lys Glu His Glu Asn Asp Pro 
1580 1585 1590 

Glu Met Leu Val Asp Leu Gin Tyr Ser Leu Ala Lys Ser Tyr Ala 
1595 1600 1605 

Ser Thr Pro Glu Leu Arg Lys Thr Trp Leu Asp Ser Met Ala Arg 
1610 1615 1620 

lie His Val Lys Asn Gly Asp Leu Ser Glu Ala Ala Met Cys Tyr 
1625 1630 1635 

Val His Val Thr Ala Leu Val Ala Glu Tyr Leu Thr Arg Lys Glu 
1640 1645 1650 

Ala Val Gin Trp Glu Pro Pro Leu Leu Pro His Ser His Ser Ala 
1655 1660 1665 

Cys Leu Arg Arg Ser Arg Gly Gly Val Phe Arg Gin Gly Cys Thr 
1670 1675 1680 

Ala Phe Arg Val lie Thr Pro Asn lie Asp Glu Glu Ala Ser Met 
1685 1690 1695 
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Met Glu Asp Val Gly Met Gin Asp Val His Phe Asn Glu Asp Val 
1700 1705 1710 

Leu Met Glu Leu Leu Glu Gin Cys Ala Asp Gly Leu Trp Lys Ala 
1715 1720 1725 

Glu Arg Tyr Glu Leu lie Ala Asp lie Tyr Lys Leu lie lie Pro 
1730 1735 1740 

lie Tyr Glu Lys Arg Arg Asp Phe Glu Arg Leu Ala His Leu Tyr 
1745 1750 1755 

Asp Thr Leu His Arg Ala Tyr Ser Lys Val Thr Glu Val Met His 
1760 . 1765 1770 

Ser Gly Arg Arg Leu Leu Gly Thr Tyr Phe Arg Val Ala Phe Phe 
1775 1780 1785 

Gly Gin Ala Ala Gin Tyr Gin Phe Thr Asp Ser Glu Thr Asp Val 
1790 1795 1800 

Glu Gly Phe Phe Glu Asp Glu Asp Gly Lys Glu Tyr He Tyr Lys 
1805 1810 1815 

Glu Pro Lys Leu Thr Pro Leu Ser Glu He Ser Gin Arg Leu Leu 
1820 1825 1830 

Lys Leu Tyr Ser Asp Lys Phe Gly Ser Glu Asn Val Lys Met He 
1835 1840 1845 

Gin Asp Ser Gly Lys Val Asn Pro Lys Asp Leu Asp Ser Lys Tyr 
1850 1855 I860 

Ala Tyr He Gin' Val Thr His Val He Pro Phe Phe Asp Glu Lys 
1865 1870 1875 

Glu Leu Gin Glu Arg Lys Thr Glu Phe Glu Arg Ser His Asn He 
1880 1885 1890 

Arg Arg Phe Met Phe Glu Met Pro Phe Thr Gin Thr Gly Lys Arg 
1895 1900 1905 

Gin Gly Gly Val Glu Glu Gin Cys Lys Arg Arg Thr He Leu Thr 
1910 1915 1920 

Ala He His Cys Phe Pro Tyr Val Lys Lys Arg He Pro Val Met 
1925 1930 1935 

Tyr Gin His His Thr Asp Leu Asn Pro He Glu Val Ala He Asp 
1940 1945 1950 

Glu Met Ser Lys Lys Val Ala Glu Leu Arg Gin Leu Cys Ser Ser 
1955 i960 1965 

Ala Glu Val Asp Met He Lys Leu Gin Leu Lys Leu Gin Gly Ser 
1970 1975 1980 

Val Ser Val Gin Val Asn Ala Gly Pro Leu Ala Tyr Ala Arg Ala 
1985 1990 1995 

Phe Leu Asp Asp Thr Asn Thr Lys Arg Tyr Pro Asp Asn Lys Val 
2000 2005 2010 

Lys Leu Leu Lys Glu Val Phe Arg Gin Phe Val Glu Ala Cys Gly 
2015 2020 2025 
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Gin 


Ala 
2030 


Leu 


Ala 


Val 


Asn 


Glu 
2035 


Arg 


Leu 


He 


Lys 


Glu 
2040 


Asp 


Gin 


Leu 


Glu 


Tyr 
2045 


Gin 


Glu 


Glu 


Met 


Lys 
2050 


Ala 


Asn 


Tyr 


Arg 


Glu 
2055 


Met 


Ala 


Lys 


Glu 


Leu 
2060 


Ser 


Glu 


He 


Met 


His 
2065 


Glu 


Gin 


He 


Cys 


Pro 
2070 


Leu 


Glu 


Glu 


Lys 


Thr 
2075 


Ser 


Val 


Leu 


Pro 


Asn 
2080 


Ser 


Leu 


His 


He 


Phe 
2085 


Asn 


Ala 


He 


Ser 


Gly 
2090 


Thr 


Pro 


Thr 


Ser 


Thr 
2095 


Met 


Val 


His 


Gly 


Met 
2100 


Thr 


Ser 


Ser 


Ser 


Ser 
2105 


Val 


Val 

























Table 9 hMXl nucleotide sequence (SEQ ID NO:9) 

ttttgtttac agggaacacg ggtctggctg agagaaaatg gccagcattt tccaagtact 60" 

gtaaattcct gtgcagaagg catcgtcgtc ttccggacag actatggtca ggtattcact 120 

tacaagcaga gcacaattac ccaccagaag gtgactgcta tgcaccccac gaacgaggag 180 

ggcgtggatg acatggcgtc cttgacagag ctccatggcg gctccatcat gtataactta 240 

ttccagcggt ataagagaaa tcaaatatgg acctacatcg gctccatcct ggcctctgtg 300 

aacccctacc agcccatcgc cgggctgtac gagcctgcca ccatggagca gtacagccgg 360 

cgccacctgg gcgagctgcc cccgcacatc ttcgccatcg ccaacgagtg ctaccgctgc 420 

ctgtggaagc gccacgacaa ccagtgcatc ctcatcaagg gtgaaagtgg ggcaggtaaa 480 

accgaaagca ctaaattgat cctcaagttt ctgtcagtca tcagtcaaca gtctttggaa 540 

ttgtccttaa aggagaagac atcctgtgtt gaacgagcta ttcttgaaag cagccccatc 600 

atggaagctt tcggcaatgc -gaagaccgtg tacaacaaca actctagtcg ctttgggaag 660 

tttgttcagc tgaacatctg tcagaaagga aatattcagg gcgggagaat tgtagattgt 720 

atcctctctt cccagaaccg agtagtaagg caaaatcccg gggaaaggaa ttatcacata 780 

ttttatgcac tgctggcagg gctggaacat gaagaaagag aagaatttta tttatctacg 840 

ccagaaaact accactactt gaatcagtct ggatgtgtag aagacaagac aatcagtgac 900 

caggaatcct ttagggaagt tattacggca atggacgtga tgcagttcag caaggaggaa 960 

gttcgggaag tgtcgaggct gcttgctggt -atactgcatc ttgggaacat agaatttatc 1020 

actgctggtg gggcacaggt ttccttcaaa acagctttgg gcagatctgc ggagttactt 1080 

gggctggacc caacacagct cacagatgct ttgacccaga gatcaatgtt cctcagggga 1140 

gaagagatcc tcacgcctct caatgttcaa caggcagtag acagcaggga ctccctggcc 1200 

atggctctgt atgcgtgctg ctttgagtgg gtaatcaaga agatcaacag caggatcaaa 1260 

ggcaatgagg acttcaagtc tattggcatc ctcgacatct ttggatttga aaactttgag 1320 

gttaatcact ttgaacagtt caatataaac tatgcaaacg agaaacttca ggagtacttc 1380 

aacaagcata ttttttcttt agaacaacta gaatatagca gggaaggatt agtgtgggaa 1440 

gatattgact ggatagacaa tggagaatgc ctggacttga ttgagaagaa acttggcctc 1500 

ctagccctta tcaatgaaga aagccatttt cctcaagcca cagacagcac cttattggag 1560 

aagctacaca gtcagcatgc gaataaccac ttttatgtga agcccagagt tgcagttaac 1620 
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aattttggag tgaagcacta tgctggagag gtgcaatatg atgtccgagg tatcttggag 1680 

aagaacagag atacatttcg agatgacctt ctcaatttgc taagagaaag ccggtttgac 1740 

tttatctacg atctttttga acatgtttca agccgcaaca accaggatac cttgaaatgt 1800 

ggaagcaaac atcggcggcc tacagtcagc tcacagttca aggttgactc actgcattcc 1860 

ttaatggcaa cgctaagctc ctctaatcct ttctttgttc gctgtatcaa gccaaacatg 1920 

cagaagatgc cagaccagtt tgaccaggcg gttgtgctga accagctgcg gtactcaggg 1980 

atgctggaga ctgtgagaat ccgcaaagct gggtatgcgg tccgaagacc ctttcaggac 2040 

ttttacaaaa ggtataaagt gctgatgagg aatctggctc tgcctgagga. cgtccgaggg 2100 

aagtgcacga gcctgctgca gctctatgat gcctccaaca gcgagtggca gctggggaag 2160 

accaaggtat ttcttcgaga atccttggaa cagaaactgg agaagcggag ggaagaggaa 222 0 

gtgagccacg cggccatggt gattcgggcc catgtcttgg gcttcttagc acggaaacaa 2280 

tacagaaagg tcctttattg tgtggtgata atacagaaga attacagagc attccttctg 2340 

aggaggagat ttttgcacct gaaaaaggca gccatagttt tccagaagca actcagaggt 2400 

cagattgctc ggagagttta cagacaattg ctggcagaga aaagggagca agaagaaaag 2460 

aagaaacagg aagaggaaga aaagaagaaa cgggaggaag aagaaagaga aagagagaga 2 52 0 

gagcgaagag aagccgagct ccgcgcccag caggaagaag aaacgaggaa gcagcaagaa 2 58 0 

ctcgaagcct tgcagaagag ccagaaggaa gctgaactga cccgtgaact ggagaaacag 2 64 0 

aaggaaaata agcaggtgga agagatcctc cgtctggaga aagaaatcga ggacctgcag 2700 

cgcatgaagg agcagcagga gctgtcgctg accgaggctt ccctgcagaa gctgcaggag 2760 

cggcgggacc aggagctccg caggctggag gaggaagcgt gcagggcggc ccaggagttc 282 0 

ctcgagtccc tcaatttcga cgagatcgac gagtgtgtcc ggaatatcga gcggtccctg 2880 

tcggggggaa gcgaattttc cagcgagctg gctgagagcg catgcgagga gaagcccaac 2940 

ttcaacttca gccagcccta cccagaggag gaggtcgatg agggcttcga agccgacgac 3000 

gacgccttca aggactcccc caaccccagc gagcacggcc actcagacca gcgaacaagt 3060 

ggcatccgga ccagcgatga ctcttcagag gaggacccat acatgaacga cacggtggtg 312 0 

cccaccagcc ccagtgcgga cagcacggtg ctgctcgccc catcagtgca ggactccggg 3180 

agcctacaca actcctccag cggcgagtcc acctactgca tgccccagaa cgctggggac 3240 

ttgccctccc cagacggcga ctacgactac gaccaggatg actatgagga cggtgccatc 3300 

acttccggca gcagcgtgac cttctccaac tcctacggca gccagtggtc ccccgactac 3360 

cgctgctctg tggggaccta caacagctcg ggtgcctacc ggttcagctc tgagggggcg 342 0 

cagtcctcgt ttgaagatag tgaagaggac tttgattcca ggtttgatac agatgatgag 3480 

ctttcatacc ggcgtgactc tgtgtacagc tgtgtcactc tgccgtattt ccacagcttt 3540 

ctgtacatga aaggtggcct gatgaactct tggaaacgcc gctggtgcgt cctcaaggat 3600 

gaaaccttct tgtggttccg ctccaagcag gaggccctca agcaaggctg gctccacaaa 3 660 

aaaggggggg gctcctccac gctgtccagg agaaattgga agaagcgctg gtttgtcctc 372 0 

cgccagtcca agctgatgta ctttgaaaac gacagcgagg agaagctcaa gggcaccgta 37 80 

gaagtgcgaa cggcaaaaga gatcatagat aacaccacca aggagaatgg gatcgacatc 3 840 

attatggccg ataggacttt ccacctgatt gcagagtccc cagaagatgc cagccagtgg 3 900 

ttcagcgtgc tgagtcaggt ccacgcgtcc acggaccagg agatccagga gatgcatgat 3 960 

gagcaggcaa acccacagaa tgctgtgggc accttggatg tggggctgat tgattctgtg 402 0 

tgtgcctctg acagccctga tagacccaac tcgtttgtga tcatcacggc caaccgggtg 4080 

ctgcactgca acgccgacac gccggaggag atgcaccact ggataaccct gctgcagagg 4140 

tccaaagggg acaccagagt ggagggccag gaattcatcg tgagaggatg gttgcacaaa 4200 

gaggtgaaga acagtccaaa gatgtcttca ctgaaactga agaaacggtg gtttgtactc 4260 
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acccacaatt ccctggatta ctacaagagt tcagagaaga acgcgctcaa actggggacc 
ctggtcctca acagcctctg ctctgtcgtc cccccagatg agaagatatt caaagagaca 
ggctactgga acgtcaccgt gtacgggcgc aagcactgtt accggctcta caccaagctg 
ctcaacgagg ccacccggtg gtccagtgtc attcaaaacg tgactgacac caaggccccg 
atcgacaccc ccacccagca gctgattcaa gatatcaagg agaactgcct gaactcggat 
gtggtggaac agatttacaa gcggaacccg atccttcgat acacccatca ccccttgcac 
tccccgctcc tgccccttcc gtatggggac ataaatctca acttgctgaa agacaaaggc 
tataccaccc ttcaggatga ggccatcaag atattcaatt ccctgcagca actggagtcc 
atgtctgacc caattccaat aatccagggc atcctacaga cagggcatga cctgcgacct 
ctgcgggacg agctgtactg ccagcttatc aaacagacca acaaagtgcc ccaccccggc 
agtgtgggca acctgtacag ctggcagatc ctgacatgcc tgagctgcac cttcctgccg 
agtcgaggga ttctcaagta tctcaagttc catctgaaaa ggatacggga acagtttcca 
ggaaccgaga tggaaaaata cgctctcttc acttacgaat ctcttaagaa aaccaaatgc 
cgagagtttg tgccttcccg agatgaaata gaagctctga tccacaggca ggaaatgaca 
tccacggtct attgccatgg cggcggctcc tgcaagatca ccatcaactc ccacaccacc 
gctggggagg tggtggagaa gctgatccga ggcctggcca tggaggacag caggaacatg 
tttgctttgt ttgaatacaa cggccacgtc gacaaagcca ttgaaagtcg aaccgtcgta 
gctgatgtct tagccaagtt tgaaaagctg gctgccacat ccgaggttgg ggacctgcca 
tggaaattct acttcaaact ttactgcttc ctggacacag acaacgtgcc aaaagacagt 
gtggagtttg catttatgtt tgaacaggcc cacgaagcgg ttatccatgg ccaccatcca 
gccccggaag aaaacctcca ggttcttgct gccctgcgac tccagtatct gcagggggat 
tatactctgc acgctgccat cccacctctc gaagaggttt attccctgca gagactcaag 
gcccgcatca gccagtcaac caaaaccttc accccttgtg aacggctgga gaagaggcgg 
acgagcttcc tagaggggac cctgaggcgg agcttccgga caggatccgt ggtccggcag 
aaggtcgagg aggagcagat gctggacatg tggattaagg aagaagtctc ctctgctcga 
gccagtatca ttgacaagtg gaggaaattt cagggaatga accaggaaca ggccatggcc 
aagtacatgg ccttgatcaa ggagtggcct ggctatggct cgacgctgtt tgatgtggag 
tgcaaggaag gtggcttccc tcaggaactc tggttgggtg tcagcgcgga cgccgtctcc 
gtctacaagc gtggagaggg aagaccactg gaagtcttcc agtatgaaca catcctctct 
tttggggcac ccctggcgaa tacgtataag atcgtggtcg atgagaggga gctgctcttt 
gaaaccagtg aggtagtgga tgtggccaag ctcatgaaag cctacatcag catgatcgtg 
aagaagcgct acagcacgac acgctccgcc agcagccagg gcagctccag g 



4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6171 



Table 10 . hMXl polypeptide sequence (SEQ ID NO: 10) 



Phe 
1 


Cys 


Leu 


Gin 


Gly 
5 


Thr 


Arg 


Val 


Trp 


Leu 
10 


Arg 


Glu 


Asn 


Gly 


Gin 
15 


His 


Phe 


Pro 


Ser 


Thr 
20 


Val 


Asn 


Ser 


Cys 


Ala 
25 


Glu 


Gly 


He 


Val 


Val 
30 


Phe 


Arg 


Thr 


Asp 


Tyr 
35 


Gly 


Gin 


Val 


Phe 


Thr 
40 


Tyr 


Lys 


Gin 


Ser 


Thr 
45 


He 


Thr 


His 


Gin 


Lys 


Val 


Thr 


Ala 


Met 


His 


Pro 


Thr 


Asn 


Glu 


Glu 


Gly 


Val 


Asp 


Asp 
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50 










55 










60 










Met 
65 


Ala 


Ser 


Leu 


Thr 


Glu 
70 


Leu 


His 


Gly 


Gly 


Ser 
75 


He 


Met 


Tyr 


Asn 


Leu 
80 


Phe 


Gin 


Arg 


Tyr 


Lys 
85 


Arg 


Asn 


Gin 


He 


Trp 
90 


Thr 


Tyr 


He 


Gly 


Ser 
95 


He 


Leu 


Ala 


Ser 


Val 
100 


Asn 


Pro 


Tyr 


Gin 


Pro 
105 


He 


Ala 


Gly 


Leu 


Tyr 
110 


Glu 


Pro 


Ala 


Thr 


Met 
115 


Glu 


Gin 


Tyr 


Ser 


Arg 
120 


Arg 


His 


Leu 


Gly 


Glu 
125 


Leu 


Pro 


Pro 


His 


He 
130 


Phe 


Ala 


He 


Ala 


Asn 
135 


Glu 


Cys 


Tyr 


Arg 


Cys 
140 


Leu 


Trp 


Lys 


Arg 


His 
145 


Asp 


Asn 


Gin 


Cys 


He 
150 


Leu 


He 


Lys 


Gly 


Glu 
155 


Ser 


Gly 


Ala 


Gly 


Lys 
160 


Thr 


Glu 


Ser 


Thr 


Lys 
165 


Leu 


He 


Leu 


Lys 


Phe 
170 


Leu 


Ser 


Val 


He 


Ser 
175 


Gin 


Gin 


Ser 


Leu 


Glu 
180 


Leu 


Ser 


Leu 


Lys 


Glu 
185 


Lys 


Thr 


Ser 


Cys 


Val 
190 


Glu 


Arg 


Ala 


He 


Leu 
195 


Glu 


Ser 


Ser 


Pro 


He 
200 


Met 


Glu 


Ala 


Phe 


Gly 
205 


Asn 


Ala 


Lys 


Thr 


Val 
210 


Tyr 


Asn 


Asn 


Asn 


Ser 
215 


Ser 


Arg 


Phe 


Gly 


Lys 
220 


Phe 


Val 


Gin 


Leu 


Asn 
225 


He 


Cys 


Gin 


Lys 


Gly 
230 


Asn 


He 


Gin 


Gly 


Gly 
235 


Arg 


He 


Val 


Asp 


Cys 
240 


He 


Leu 


Ser 


Ser 


Gin 
245 


Asn 


Arg 


Val 


Val 


Arg 
250 


Gin 


Asn 


Pro 


Gly 


Glu 
255 


Arg 


Asn 


Tyr 


His 


He 
260 


Phe 


Tyr 


Ala 


Leu 


Leu 
265 


Ala 


Gly 


Leu 


Glu 


His 
270 


Glu 


Glu 


Arg 


Glu 


Glu 
275 


Phe 


Tyr 


Leu 


Ser 


Thr 
280 


Pro 


Glu 


Asn 


Tyr 


His 
285 


Tyr 


Leu 


Asn 


Gin 


Ser 
290 


Gly 


Cys 


Val 


Glu 


Asp 
295 


Lys 


Thr 


He 


Ser 


Asp 
300 


Gin 


Glu 


Ser 


Phe 


Arg 
305 


Glu 


Val 


He 


Thr 


Ala 
310 


Met 


Asp 


Val 


Met 


Gin 
315 


Phe 


Ser 


Lys 


Glu 


Glu 
320 


Val 


Arg 


Glu 


Val 


Ser 
325 


Arg 


Leu 


Leu 


Ala 


Gly 
330 


He 


Leu 


His 


Leu 


Gly 
335 


Asn 


He 


Glu 


Phe 


lie 
340 


Thr 


Ala 


Gly 


Gly 


Ala 
345 


Gin 


Val 


Ser 


Phe 


Lys 
350 


Thr 


Ala 


Leu 


Gly 


Arg 
355 


Ser 


Ala 


Glu 


Leu 


Leu 
360 


Gly 


Leu 


Asp 


Pro 


Thr 
365 


Gin 


Leu 


Thr 


Asp 


Ala 
370 


Leu 


Thr 


Gin 


Arg 


Ser 
375 


Met 


Phe 


Leu 


Arg 


Gly 
380 


Glu 


Glu 


He 


Leu 


Thr 
385 


Pro 


Leu 


Asn 


Val 


Gin 
390 


Gin 


Ala 


Val 


Asp 


Ser 
395 


Arg 


Asp 


Ser 


Leu 


Ala 
400 


Met 


Ala 


Leu 


Tyr 


Ala 


Cys 


Cys 


Phe 


Glu 


Trp 


Val 


He 


Lys 


Lys 


He 


Asn 
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405 










410 










415 




Ser 


Arg 


He 


Lys 
420 


Gly 


Asn 


Glu 


Asp 


Phe 
425 


Lys 


Ser 


He 


Gly 


He 
430 


Leu 


Asp 


He 


Phe 


Gly 
435 


Phe 


Glu 


Asn 


Phe 


Glu 
440 


Val 


Asn 


His 


Phe 


Glu 
445 


Gin 


Phe 


Asn 


He 


Asn 
450 


Tyr 


Ala 


Asn 


Glu 


Lys 
455 


Leu 


Gin 


Glu 


Tyr 


Phe 
460 


Asn 


Lys 


His 


He 


Phe 
465 


Ser 


Leu 


Glu 


Gin 


Leu 
470 


Glu 


Tyr 


Ser 


Arg 


Glu 
475 


Gly 


Leu 


Val 


Trp 


Glu 
480 


Asp 


He 


Asp 


Trp 


He 
485 


Asp 


Asn 


Gly 


Glu 


Cys 
490 


Leu 


Asp 


Leu 


He 


Glu 
495 


Lys 


Lys 


Leu 


Gly 


Leu 
500 


Leu 


Ala 


Leu 


He 


Asn 
505 


Glu 


Glu 


Ser 


His 


Phe 
510 


Pro 


Gin 


Ala 


Thr 


Asp 
515 


Ser 


Thr 


Leu 


Leu 


Glu 
520 


Lys 


Leu 


His 


Ser 


Gin 
525 


His 


Ala 


Asn 


Asn 


His 
530 


Phe 


Tyr 


Val 


Lys 


Pro 
535 


Arg 


Val 


Ala 


Val 


Asn 
540 


Asn 


Phe 


Gly 


Val 


Lys 
545 


His 


Tyr 


Ala 


Gly 


Glu 
550 


Val 


Gin 


Tyr 


Asp 


Val 
555 


Arg 


Gly 


He 


Leu 


Glu 
560 


Lys 


Asn 


Arg 


Asp 


Thr 
565 


Phe 


Arg 


Asp 


Asp 


Leu 
570 


Leu 


Asn 


Leu 


Leu 


Arg 
575 


Glu 


Ser 


Arg 


Phe 


Asp 
580 


Phe 


He 


Tyr 


Asp 


Leu 
585 


Phe 


Glu 


His 


Val 


Ser 
590 


Ser 


Arg 


Asn 


Asn 


Gin 
595 


Asp 


Thr 


Leu 


Lys 


Cys 
600 


Gly 


Ser 


Lys 


His 


Arg 
605 


Arg 


Pro 


Thr 


Val 


Ser 
610 


Ser 


Gin 


Phe 


Lys 


Val 
615 


Asp 


Ser 


Leu 


His 


Ser 
620 


Leu 


Met 


Ala 


Thr 


Leu 
625 


Ser 


Ser 


Ser 


Asn 


Pro 
630 


Phe 


Phe 


Val 


Arg 


Cys 
635 


He 


Lys 


Pro 


Asn 


Met 
640 


Gin 


Lys 


Met 


Pro 


Asp 
645 


Gin 


Phe 


Asp 


Gin 


Ala 
650 


Val 


Val 


Leu 


Asn 


Gin 
655 


Leu 


Arg 


Tyr 


Ser 


Gly 
660 


Met 


Leu 


Glu 


Thr 


Val 
665 


Arg 


He 


Arg 


Lys 


Ala 
670 


Gly 


Tyr 


Ala 


Val 


Arg 
675 


Arg 


Pro 


Phe 


Gin 


Asp 
680 


Phe 


Tyr 


Lys 


Arg 


Tyr 
685 


Lys 


Val 


Leu 


Met 


Arg 
690 


Asn 


Leu 


Ala 


Leu 


Pro 
695 


Glu 


Asp 


Val 


Arg 


Gly 
700 


Lys 


Cys 


Thr 


Ser 


Leu 
705 


Leu 


Gin 


Leu 


Tyr 


Asp 
710 


Ala 


Ser 


Asn 


Ser 


Glu 
715 


Trp 


Gin 


Leu 


Gly 


Lys 
720 


Thr 


Lys 


Val 


Phe 


Leu 
725 


Arg 


Glu 


Ser 


Leu 


Glu 
730 


Gin 


Lys 


Leu 


Glu 


Lys 
735 


Arg 


Arg 


Glu 


Glu 


Glu 
740 


Val 


Ser 


His 


Ala 


Ala 
745 


Met 


Val 


He 


Arg 


Ala 
750 


His 


Val 


Leu 


Gly 


Phe 


Leu 


Ala 


Arg 


Lys 


Gin 


Tyr 


Arg 


Lys 


Val 


Leu 


Tyr 


Cys 


Val 
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755 760 


765 








Val 


lie lie Gin Lys Asn Tyr Arg Ala Phe Leu Leu Arg 
770 775 780 


Arg 


Arg 


Phe 


Leu 
785 


His Leu Lys Lys Ala Ala lie Val Phe Gin Lys Gin 
790 795 


Leu 


Arg 


Gly 
800 


Gin 


lie Ala Arg Arg Val Tyr Arg Gin Leu Leu Ala Glu 
805 810 


Lys 


Arg 
815 


Glu 


Gin 


Glu Glu Lys Lys Lys Gin Glu Glu Glu Glu Lys Lys 
820 825 


Lys 
830 


Arg 


Glu 


Glu 


Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala 
835 840 845 


Glu 


Leu 


Arg 


Ala 


Gin Gin Glu Glu Glu Thr Arg Lys Gin Gin Glu Leu 
850 855 860 


Glu 


Ala 


Leu 


Gin 
865 


Lys Ser Gin Lys Glu Ala Glu Leu Thr Arg Glu Leu 
870 875 


Glu 


Lys 


Gin 
880 


Lys 


Glu Asn Lys Gin Val Glu Glu lie Leu Arg Leu Glu 
885 890 


Lys 


Glu 
895 


He 


Glu 


Asp Leu Gin Arg Met Lys Glu Gin Gin Glu Leu Ser 
900 905 


Leu 
910 


Thr 


Glu 


Ala 


Ser Leu Gin Lys Leu Gin Glu Arg Arg Asp Gin Glu 
915 920 925 


Leu 


Arg 


Arg 


Leu 


Glu Glu Glu Ala Cys Arg Ala Ala Gin Glu Phe Leu 
930 935 940 


Glu 


Ser 


Leu 


Asn 
945 


Phe Asp Glu lie Asp Glu Cys Val Arg Asn lie Glu 
950 955 


Arg 


Ser 


Leu 
960 


Ser 


Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser 
965 970 


Ala 


Cys 
975 


Glu 


Glu 


Lys Pro Asn Phe Asn Phe Ser Gin Pro Tyr Pro Glu 
980 985 


Glu 
990 


Glu 


Val 


Asp 


Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser 
995 1000 1005 


Pro 


Asn 


Pro 


Ser Glu His Gly His Ser Asp Gin Arg Thr 
1010 1015 


Ser 
1020 


Gly 


He 


Arg 


Thr 


Ser Asp Asp Ser Ser Glu Glu Asp Pro Tyr 
1025 1030 


Met 
1035 


Asn 


Asp 


Thr 


Val 


Val Pro Thr Ser Pro Ser Ala Asp Ser Thr 
ltf40 1045 


Val 
1050 


Leu 


Leu 


Ala 


Pro 


Ser Val Gin Asp Ser Gly Ser Leu His Asn 
1055 1060 


Ser 
1065 


Ser 


Ser 


Gly 


Glu 


Ser Thr Tyr Cys Met Pro Gin Asn Ala Gly 
1070 1075 


Asp 
1080 


Leu 


Pro 


Ser 


Pro 


Asp Gly Asp Tyr Asp Tyr Asp Gin Asp Asp 
1085 1090 


Tyr 
1095 


Glu 


Asp 


Gly 


Ala 


lie Thr Ser Glv Ser Ser Val Thr Phe Ser 


Asn 


Ser 


Tyr 


Glv 
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noo nog nio 

Ser Gin Trp Ser Pro Asp Ty^ Arg Cys Ser Val Gly. Thr Tyr Asn 



1115 



Ser Ser Gly Ala Tyr Arg Phe Ser Ser Glu Gly Ala Gin Ser Ser 

1130 1135 
Phe Glu Asp Ser Glu Glu Asp Phe Asp Ser Arg Phe Asp Thr Asp 

1145 ll 50 
Asp G1U q Leu Ser Tyr ArgArg^ Asp Ser Val Tyr Se^ Cys Val Thr 

Leu Pro Tyr Phe His Ser Phe Leu Tyr Met Lys Gly Gly Leu Met 

1180 1103 



1175 



Asn Ser Trp Lys Arg Arg Trp Cys Val Leu Lys Asp Glu Thr Phe 

1190 II 95 1200 

Leu Trp Phe Arg Ser Lys Gin Glu Ala Leu Lys Gin Gly Trp Leu 
1205 1210 

His Lys Lys Gly Gly Gly Ser Ser Thr Leu Ser Arg Arg Asn Trp 

1225 



1220 



Lys Lys Arg Trp Phe Val LeU Q Arg Gin Ser Lys Le^ Met Tyr Phe 

Glu Asn Asp Ser Glu Glu Lys Leu Lys Gly Thr Val Glu Val Arg 

1250 1255 

Thr Ala Lys Glu He He Asp Asn Thr Thr Lys Glu Asn Gly He 
1265 1270 1275 

Asp He He Met Ala Asp Arg Thr Phe His Leu lie Ala Glu Ser 

1280 1285 
Pro Glu s Asp Ala Ser Gin Trp^ Phe Ser Val Leu Se^ Gin Val His 

Ala Ser Thr Asp Gin Glu He Gin Glu Met His Asp Glu Gin Ala 
1310 1315 I 320 

Asn Pro Gin Asn Ala Val Gly Thr, Leu Asp Val Gly Leu He Asp 
1325 1330 1335 

Ser Val Cys Ala Ser Asp Ser Pro Asp Arg Pro Asn Ser Phe Val 
1340 1345 1350 

He lie Thr Ala Asn Arg Val Leu His Cys Asn Ala Asp Thr Pro 
1355 1360 1365 

Glu Glu Met His His Trp lie Thr Leu Leu Gin Arg^ Ser Lys Gly 

Asp Thr Arg Val Glu Gly Gin Glu Phe He Val Arg Gly Trp Leu 

1385 1390 

His Lys Glu Val Lys Asn Ser Pro Lys Met Ser Ser Leu Lys Leu 

1400 1405 

Lys Lys Arg Trp Phe Val Leu Thr His Asn Ser Leu Asp Tyr Tyr 

1415 1420 
T. yR Ser Ser m„ Lvs Asn A l* Leu Lys Leu Gly Thr Leu Val Leu 
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1430 










1435 










1440 








Asn 


Ser 
1445 


Leu 


Cys 


Ser 


Val 


Val 
1450 


Pro 


Pro 


Asp 


Glu 


Lys 
1455 


He 


Phe 


Lys 


Glu 


Thr 
1460 


Gly 


Tyr 


Trp 


Asn 


Val 
1465 


Thr 


Val 


Tyr 


Gly 


Arg 
1470 


Lys 


His 


Cys 


Tyr 


Arg 
1475 


Leu 


Tyr 


Thr 


Lys 


Leu 
1480 


Leu 


Asn 


Glu 


Ala 


Thr 
1485 


Arg 


Trp 


Ser 


Ser 


Val 
1490 


He 


Gin 


Asn 


Val 


Thr 
1495 


Asp 


Thr 


Lys 


Ala 


Pro 
1500 


He 


Asp 


Thr 


Pro 


Thr 
1505 


Gin 


Gin 


Leu 


He 


Gin 
1510 


Asp 


He 


Lys 


Glu 


Asn 
1515 


Cys 


Leu 


Asn 


Ser 


Asp 
1520 


Val 


Val 


Glu 


Gin 


He 
1525 


Tyr 


Lys 


Arg 


Asn 


Pro 
1530 


He 


Leu 


Arg 


Tyr 


Thr 
1535 


His 


His 


Pro 


Leu 


His 
1540 


Ser 


Pro 


Leu 


Leu 


Pro 
1545 


Leu 


Pro 


Tyr 


Gly 


Asp 
1550 


He 


Asn 


Leu 


Asn 


Leu 
1555 


Leu 


Lys 


Asp 


Lys 


Gly 
1560 


Tyr 


Thr 


Thr 


Leu 


Gin 
1565 


Asp 


Glu 


Ala 


He 


Lys 
1570 


He 


Phe 


Asn 


Ser 


Leu 
1575 


Gin 


Gin 


Leu 


Glu 


Ser 
1580 


Met 


Ser 


Asp 


Pro 


He 
1585 


Pro 


lie 


He 


Gin 


Gly 
1590 


He 


Leu 


Gin 


Thr 


Gly 
1595 


His 


Asp 


Leu 


Arg 


Pro 
1600 


Leu 


Arg 


Asp 


Glu 


Leu 
1605 


Tyr 


Cys 


Gin 


Leu 


He 
1610 


Lys 


Gin 


Thr 


Asn 


Lys 
1615 


Val 


Pro 


His 


Pro 


Gly 
1620 


Ser 


Val 


Gly 


Asn 


Leu 
1625 


Tyr 


Ser 


Trp 


Gin 


He 
1630 


Leu 


Thr 


Cys 


Leu 


Ser 
1635 


Cys 


Thr 


Phe 


Leu 


Pro 
1640 


Ser 


Arg 


Gly 


He 


Leu 
1645 


Lys 


Tyr 


Leu 


Lys 


Phe 
1650 


His 


Leu 


Lys 


Arg 


He 
1655 


Arg 


Glu 


Gin 


Phe 


Pro 
1660 


Gly 


Thr 


Glu 


Met 


Glu 
1665 


Lys 


Tyr 


Ala 


Leu 


Phe 
1670 


Thr 


Tyr 


Glu 


Ser 


Leu 
1675 


Lys 


Lys 


Thr 


Lys 


Cys 
1680 


Arg 


Glu 


Phe 


Val 


Pro 
1685 


Ser 


Arg 


Asp 


Glu 


He 
1690 


Glu 


Ala 


Leu 


He 


His 
1695 


Arg 


Gin 


Glu 


Met 


Thr 
1700 


Ser 


Thr 


Val 


Tyr 


Cys 
1705 


His 


Gly 


Gly 


Gly 


Ser 
1710 


Cys 


Lys 


He 


Thr 


He 
1715 


Asn 


Ser 


His 


Thr 


Thr 
1720 


Ala 


Gly 


Glu 


Val 


Val 
1725 


Glu 


Lys 


Leu 


He 


Arg 
1730 


Gly 


Leu 


Ala 


Met 


Glu 
1735 


Asp 


Ser 


Arg 


Asn 


Met 
1740 


Phe 


Ala 


Leu 


Phe 


Glu ' 
1745 


Tyr 


Asn 


Gly 


His 


Val 
1750 


Asp 


Lys 


Ala 


He 


Glu 
1755 


Ser 


Arg 


Thr 


Val 


Val 


Ala 


Asp 


Val 


Leu 


Ala 


Lys 


Phe 


Glu 


Lys 


Leu 


Ala 


Ala 


Thr 
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1760 










1765 










1770 








Ser 


Glu 
1775 


Val 


Gly 


Asp 


Leu 


Pro 
1780 


Trp 


Lys 


Phe 


Tyr 


Phe 
1785 


Lys 


Leu 


Tyr 


Cys 


Phe 
1790 


Leu 


Asp 


Thr 


Asp 


Asn 
1795 


Val 


Pro 


Lys 


Asp 


Ser 
1800 


Val 


Glu 


Phe 


Ala 


Phe 
1805 


Met 


Phe 


Glu 


Gin 


Ala 
1810 


His 


Glu 


Ala 


Val 


He 
1815 


His 


Gly 


His 


His 


Pro 
1820 


Ala 


Pro 


Glu 


Glu 


Asn 
1825 


Leu 


Gin 


Val 


Leu 


Ala 
1830 


Ala 


Leu 


Arg 


Leu 


Gin 
1835 


Tyr 


Leu 


Gin 


Gly 


Asp 
1840 


Tyr 


Thr 


Leu 


His 


Ala 
1845 


Ala 


He 


Pro 


Pro 


Leu 
1850 


Glu 


Glu 


Val 


Tyr 


Ser 
1855 


Leu 


Gin 


Arg 


Leu 


Lys 
1860 


Ala 


Arg 


He 


Ser 


Gin 
1865 


Ser 


Thr 


Lys 


Thr 


Phe 
1870 


Thr 


Pro 


Cys 


Glu 


Arg 
1875 


Leu 


Glu 


Lys 


Arg 


Arg 
1880 


Thr 


Ser 


Phe 


Leu 


Glu 
1885 


Gly 


Thr 


Leu 


Arg 


Arg 
1890 


Ser 


Phe 


Arg 


Thr 


Gly 
1895 


Ser 


Val 


Val 


Arg 


Gin 
1900 


Lys 


Val 


Glu 


Glu 


Glu 
1905 


Gin 


Met 


Leu 


Asp 


Met 
1910 


Trp 


He 


Lys 


Glu 


Glu 
1915 


Val 


Ser 


Ser 


Ala 


Arg 
1920 


Ala 


Ser 


He 


lie 


Asp 
1925 


Lys 


Trp 


Arg 


Lys 


Phe 
1930 


Gin 


Gly 


Met 


Asn 


Gin 
1935 


Glu 


Gin 


Ala 


Met 


Ala 
1940 


Lys 


Tyr 


Met 


Ala 


Leu 
1945 


He 


Lys 


Glu 


Trp 


Pro 
1950 


Gly 


Tyr 


Gly 


Ser 


Thr 
1955 


Leu 


Phe 


Asp 


Val 


Glu 
1960 


Cys 


Lys 


Glu 


Gly 


Gly 
1965 


Phe 


Pro 


Gin 


Glu 


Leu 
1970 


Trp 


Leu 


Gly 


Val 


Ser 
1975 


Ala 


Asp 


Ala 


Val 


Ser 
1980 


Val 


Tyr 


Lys 


Arg 


Gly 
1985 


Glu 


Gly 


Arg 


Pro 


Leu 
1990 


Glu 


Val 


Phe 


Gin 


Tyr 
1995 


Glu 


His 


He 


Leu 


Ser 
2000 


Phe 


Gly 


Ala 


Pro 


Leu 
2005 


Ala 


Asn 


Thr 


Tyr 


Lys 
2010 


He 


Val 


Val 


Asp 


Glu 
2015 


Arg 


Glu 


Leu 


Leu 


Phe 
2020 


Glu 


Thr 


Ser 


Glu 


Val 
2025 


Val 


Asp 


Val 


Ala 


Lys 
2030 


Leu 


Met 


Lys 


Ala 


Tyr 
2035 


He 


Ser 


Met 


He 


Val 
2040 


Lys 


Lys 


Arg 


Tyr 


Ser 
2045 


Thr 


Thr 


Arg 


Ser 


Ala 
2050 


Ser 


Ser 


Gin 


Gly 


Ser 
2055 


Ser 


Arg 





Table 11 hMX2 nucleotide sequence (SEQ ID NO: 1 1) 
tagtatc ttttattgtc agaacttctg tgagccaaca aacagttttg catggttgta 
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cacaaaggga caaggcaaat ttc ttttttc gtgtgggtag acttagttgg cccaagtcct 
taaaactttt ccatataaaa ataaaaagtc caagaccaga ttatttttct tctggtcata 
aatgctgatt tatttacagg tgccttgttc agaccaccat tataaacttg ggataaaata 
tgtgtgtatt aaagcctcag catttaatgt cagggtcctt tgaagattca ctcaagtgtt 
aagacgtttc tggaatgcag cgtctctccc ccatagtcaa catggttatt atatctgtaa 
tctatccaga atgatagaag ctaaccttcc aagtaacact ttgtttttaa cttaaatctt 
ttagacatga aagactccaa aatgacttca ttcttgttct aaaaccagca ctggagccag 
ctgttgaaga gtggtttata aatacagtta tcttgtaggc tgcttatctg tttataatac 
agcagacaca gatggcagac tttgctacat gtaaaacaat ggagtcaaca cgtgtttttc 
aaaatacagc aaagacagga aaatccagga tttgggtttg ttaataaaac caccttataa 
agtaacaatt gagactatag ctctgcatta ttaaaatata cagactgtgt acaccattac 
acatcctttt tccctttgct ttttaatgct catgaaacca tgattaaggt gttgagttta 
tgaacacatg cacgaacagg caagcacgta cacttaaaag atgaaacaaa gaaaaaagtt 
gattcatgtc attccatgag aaaggctgcc cgcagcactc cagctcaaac acactgtccc 
ctcgagctct ccatccccct tcccactccc tcaccttccc tcagattcgg ggaaatcagg 
ttgggaggtt agtgcatcat tgacagagaa tgcccccctt ccacgctctg ttaagtctcc 
cccagaaggg ggaaaggcag ttcccttcag tagcacagtt acggtcgatt agtgttggtt 
ccacaagtta aggcacttcc ggctgctttg gtggcagcgt ggttcctccc ctcctttttt 
aaggcatgtg tcctctaaga gtagtaaagc tttggaaact gtgcagactg ttaaagttga 
cagcttaata caggatcaat gaaggcggca ggcaaaagga tcctcggaga cacctccctc 
agaccagaag cttccagaaa gcctgggcag ctctgtgttt gttttggctg ggcatggcac 
actggagcca gcctaggcca gagggtggtg cgttcaggta gcaaagacag gtgggctctg 
tcccgccttc acctggagct gccctggctg ctggcggagc gtgtcgtgct gtagcgcttc 
ttcacgatca tgctgatgta ggctttcatg agcttggcca catccaccac ctcactggtt 
tcaaagagca gctccctctc atcgaccacg atcttatacg tattcgccag gggtgcccca 
aaagagagga tgtgttcata ctggaagact tccagtggtc ttccctctcc acgcttgtag 
acggagacgg cgtccgcgct gacacccaac cagagttcct gagggaagcc accttccttg 
cactggtggg caagagtcaa cagaagagtt aagtcatgaa gtggttggca acagaaagca 
tctaaaccat aagacaggct ttgagtgaag tcctctgtgc agaagattaa atatattcga 
tgtgcatgca tgcatggagg ggcctgaaat atgaaaaatg gcacctctct ggctatcttg 
atttctaact agttaatctc acgcttttgg gaaaacctca ctaactggca gagtctaaca 
tcttgctttg actctccact tctcagcatt attctactag ctgtttggat tagctacgtg 
gaagtggcct ggaaacgtac atgcttggcc gggggactta agaaagcttc cctgcaaccc 
aagccaagtc tactcttgta ttaatatctc cagttctgcc tccaatcctc tttgcggatg 
gttagtcttc aaatacaaaa tctaggatca cagaggaaaa ttctccaaat cacgcatgct 
gagcagttct ggctcctctt cacaaggagc agcaatggcc ttccatatgc agagtgggaa 
cagggacttt accagtttaa ctgtagactt tcctgtacag attggtggaa gaaaataaga 
ccccatatga aggggctaac aacacagggc tgatccaaac ctggacaagc aggagggcta 
taaattggag acgctgaaaa gagtctctag tttatatccc taataaccag acattctctg 
catcctccat gcaaaagcca gtagctttct ttttttcttt ttttttttga cggagtctca 
ctctgtcgcc caggctggag tgcagtggcg tgatcctggc tcactgcaac ctccacctcc 
tgagttcaag cgattctcct gtctcagcct ctcgagtagc tgggattaca ggtgcatgcc 
accacgccca gctaattttt tgcagagatg gggtttcacc gtgttagcca ggatggtctc 
gatctcctga cctcatgatc cgcccgcctt ggcctcccaa agcgctggga ttacaggcat 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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i agccaccgc gcccggccaa gccagtagct t tctatgcta attcacagct cacotcccgc 
aggaagccaa gagtttaact gctattatct .ttccttgtc agggagaaat ggaattatgg 
cLgtacaa agcacctgafc ttttttatac ttaaaaacag gcataattga «c«cc» 
ccaatcaaaa acatcaccta atgaaaagcc acccacggat tctagaattt ataatattta 
gaattttata cagcctcaat ataaagtcat cagatatacg ctgaattact ^gatcataa 
aaaatggaag ctaatctaga cgatgagctg gcacacttat ctgttaaggg ctocat-ot. 
aagattttta gactttgtgg atcacatggt ctctgtcaca actactcaac ^tgtacaaa 
aacagccagg gaatatatct aaaggaatga gcttggctgt gttccaataa 
agaaaaaaag gaggcaggca agatctgacc cacagaccag tttgccaaac "tcatctag 
acaattagta agatttcttt tcaataagcg gtctacttaa aacaaaacaa aaatcagtac 
tgggtt ga t g ccaatggcta aattccatta cgagatagac .ttcttcctt t« 
ggctgtaaag aaaaaacaaa gtaaaatgca agtatatcca aagtttctaa "tgtatata 
!agcta t aac atttttttaa atgtagattt ttatcagtgt ttaaaaaatt agatctatag 
cttccctaag gaagggtaga agaatagatg acatcttaat tttgcattca "cctaatat 
tacagatgca tttactacac aggagaagag aaactgtgag gagaagggag ^taatgg 
tacaattttg ggggctcgaa aaaaagaggt tgagagagca aaatgctcca tctt 9 tc "* 
tctccacatg aacttggccg tgatccatgt tctcagatgc cagcacccag cccaccccaa 
cacatggcag ccagttctca cctccacatc aaacagcgtc gagccatagc caggcc 
cttgatcaag gccatgtact tggccatggc ctgttcctgg ttcattccct gaaatttcct 
ccacttgtca atgatactgg ctcgagcaga ggagacttct tccttaatcc -atgtcca 
catctgctcc tcctcgacct tctgccggac cacggatcct gtccggaagc tccgcctcag 
ggtcccctct aggaagctcg tccgcctctt ctccagccgt tcacaagggg ^aaggtttt 
Jgttgactgg ctgatgcggg ccttgagtct ctgcagggaa taaacctctt cgagaggtgg 
gatggcagcg tgcagagtat aatccccctg cagatactgg agtcgcaggg c-gc^c 
Lggaggttt tcttccgggg ctggatggtg gccatggata . accgcttcgt gggcctgttc 
aaacataaat gcaaactcca cactgtcttt tggcacgttg tctgtgtcca waagcagta 
aagtttgaag tagaatttcc atggcaggtc cccaacctcg gatgtggcag ccagcttttc 
aaacttggct aagacatcag ctacgacggt tcgactttca atggctttgt cgacgtggcc 
gttgtattca aacaaagcaa acatgttcct gctgtcctcc atggccaggc ctcggatcag 
cttctccacc acctccccag cggtggtgtg ggagttgatg gtgatcttgc aggagccgcc 
gccatggcaa tagaccgtgg atgtcatttc ctgcctgtgg atcagagctt ctatttcatc 
tcgggaaggc acaaactctc ggcatttggt tttcttaaga gattcgtaag tgaagagagc 
gtatttttcc atctcggttc ctggaaactg ttcccgtatc cttttcagat ggaacttgag 
atacttgaga atccctcgac tcggcaggaa ggtgcagctc aggcatgtca ggatctgcca 
gctgtacagg ttgcccacac tgccggggtg gggcactttg ttggtctgtt tgataagctg 
gcagtacagc tcgtcccgca gaggtcgcag gtcatgccct gtctgtagga tgccctggat 
ta^ggaatt gggtcagaca tggactccag ttgctgcagg gaattgaata "ttgatggc 
ctcatcctga agggtggtat agcctttgtc tttcagcaag ttgagattta ^cc 
cggaaggggc aggagcgggg agtgcaaggg gtgatgggtg tatcgaagga tcgggttccg 
cttgtaaatc tgttccacca catccgagtt caggcagttc tccttgatat -tgaatcag 
ctgctgggtg ggggtgtcga tcggggcctt ggtgtcagtc acgttttgaa tgacactgga 
ccLcgggtg gcctcgttga gcagcttggt gtagagccgg taacagtgct tgcgcccgta 
cacggtgacg ttccagtagc ctgtctcttt gaatatcttc tcatctgggg ggacgacaga 
graglggctg ttgagga ^ccc^^ 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 
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gtaatccagg 
ctttggactg 
cactctggtg 
cgtgtcggcg 
atcagggctg 
attctgtggg 
gacctgactc 
gaaagtccta 
ctcttttgcc 
gtacatcagc 
cgtggaggag 
gcggaaccac 
caggccacct 
agagtcacgc 
actatcttca 
gtaggtcccc 
ggtcacgctg 
gtcgccgtct 
gctggaggag 
gtccgcactg 
gtcatcgctg 
gggggagtcc 
gtagggctgg 
ggaaaattcg 
gtcgaaattg 
gcggagctcc 
ctcctgctgc 
ttccacctgc 
gctcttctgc 
gagctcggct 
ttcttcctct 
gtaaactctc 
caggtgcaaa 
acaataaagg 
caccatggcc 
ttctcgaaga 
ctgcagcagg 
cactttatac 
gattctcaca 
aaactggtct 
ggagcttagc 
aggccgccga 
ttcaaaaaga 
tcgaaatgta 



gaattgtggg 
ttcttcacct 
tcccctttgg 
ttgcagtgca 
tcagaggcac 
tttgcctgct 
agcacgctga 
tcggccataa 
gttcgcactt 
ttggactggc 
cccccccctt 
aagaaggttt 
ttcatgtaca 
cggtatgaaa 
aacgaggact 
acagagcagc 
ctgccggaag 
ggggagggca 
ttgtgtaggc 
gggctggtgg 
gtccggatgc 
ttgaaggcgt 
ctgaagttga 
cttccccccg 
agggactcga 
tggtcccgcc 
tccttcatgc 
ttattttcct 
aaggcttcga 
tctcttcgct 
tcctgtttct 
cgagcaatct 
aatctcctcc 
acctttctgt 
gcgtggctca 
aataccttgg 
ctcgtgcact 
cttttgtaaa 
gtctccagca 
ggcatcttct 
gttgccatta 
tgtttgcttc 
tcgtagataa 
tctctgttct 



tgagtacaaa 
ctttgtgcaa 
acctctgcag 
gcacccggtt 
acacagaatc 
catcatgcat 
accactggct 
tgatgtcgat 
ctacggtgcc 
ggaggacaaa 
ttttgtggag 
catccttgag 
gaaagctgtg 
gctcatcatc 
gcgccccctc 
ggtagtcggg 
tgatggcacc 
agtccccagc 
tcccggagtc 



gcaccaccgt 
cacttgttcg 
cgtcgtcggc 
agttgggctt 
acagggaccg 
ggaactcctg 
gctcctgcag 
gctgcaggtc 



tctgtttctc 
gttcttgctg 
ctctctctct 
tcttttcttc 
gacctctgag 
tcagaaggaa 
attgtttccg 
cttcctcttc 
tcttccccag 
tccctcggac 
agtcctgaaa 
tccctgagta 
gcatgtttgg 
aggaatgcag 
cacatttcaa 
agtcaaaccg 
tctccaagat 



ccaccgtttc 
ccatcctctc 
cagggttatc 
ggccgtgatg 
aatcagcccc 
ctcctggatc 
ggcatcttct 
cccattctcc 
cttgagcttc 
ccagcgcttc 
ccagccttgc 
gacgcaccag 
gaaatacggc 
tgtatcaaac 
agagctgaac 
ggaccactgg 
gtcctcatag 
gttctggggc 
ctgcactgat 
gtcgttcatg 
ctggtctgag 
ttcgaagccc 
ctcctcgcat 
ctcgatattc 
ggccgccctg 
cttctgcagg 
ctcgatttct 
cagttcacgg 
cttcctcgtt 
ttctctttct 
ttgctccctt 
ttgcttctgg 
tgctctgtaa 
tgctaagaag 
cctccgcttc 
ctgccactcg 
gtcctcaggc 
gggtcttcgg 
ccgcagctgg 
cttgatacag 
tgagtcaacc 
ggtatcctgg 
gctttctctt 
acctcggaca 



ttcagtttca 
acgatgaatt 
cagtggtgca 
atcacaaacg 
acatccaagg 
tcctggtccg 

gggg actctg 
ttggtggtgt 

tcctcgctgt 
ttccaatttc 
ttgagggcct 
cggcgtttcc 
agagtgacac 
ctggaatcaa 
cggtaggcac 
ctgccgtagg 
tcatcctggt 
atgcagtagg 
ggggcgagca 
tatgggtcct 
tggccgtgct 
tcatcgacct 
gcgctctcag 
cggacacact 
cacgcttcct 
gaagcctcgg 



ttctccagac 
gtcagttcag 
tcttcttcct 
tcttcctccc 
ttctctgcca 
aaaactatgg 
ttcttctgta 
cccaagacat 
tccagtttct 



ctgttggagg 
agagccagat 
accgcatacc 
ttcagcacaa 
cgaacaaaga 
ttgaactgtg 
ttgttgcggc 
agcaaattga 
tcatattgca 



gtgaagacat 
cctggccctc 
tctcctccgg 
agttgggtct 
tgcccacagc 
tggacgcgtg 
caatcaggtg 
tatctatgat 
cgttttcaaa 
tcctggacag 
cctgcttgga 
aagagttcat 
agctgtacac 
agtcctcttc 
ccgagctgtt 
agttggagaa 
cgtagtcgta 
tggactcgcc 
gcaccgtgct 
cctctgaaga 
cgctggggtt 
cctcctctgg 
ccagctcgct 
cgtcgatctc 
cctccagcct 
tcagcgacag 
ggaggatctc 
cttccttctg 
gctgggcgcg 
gtttcttctt 
gcaattgtct 
ctgccttttt 
ttatcaccac 
gggcccgaat 
gttccaagga 
catcatagag 
tcctcatcag 
cagctttgcg 
ccgcctggtc 
aaggattaga 
agctgactgt 
ttgaaacatg 
gaaggtcatc 



cctctccagc 



5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
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atagtgcttc 
cgcatgctga 
ttcttcattg 
attgtctatc 
taaagaaaaa 
gaactgttca 
agacttgaag 
gcagcacgca 
gagaggcgtg 
gagctgtgtt 
aacctgtgcc 
cagcctcgac 
aacttcccta 
caagtagtgg 
ccctgccagc 
tcggttctgg 
acagatgttc 
cgcattgccg 
tgtcttctcc 
gatcaattta 
gttgtcgtgg 
gggcagctcg 
ggcgatgggc 
atttctctta 
ggacgccatg 
ggtaattgtg 
gccttctgca 
ccgtgttccc 



actccaaaat 
ctgtgtagct 
ataagggcta 
cagtcaatat 
atatgcttgt 
aagtgattaa 
tcctcattgc 
tacagagcca 
aggatctctt 
gggtccagcc 
ccaccagcag 
acttcccgaa 
aaggattcct 
tagttttctg 
agtgcataaa 
gaagagagga 
agctgaacaa 
aaagcttcca 
tttaaggaca 
gtgctttcgg 
cgcttccaca 
cccaggtggc 
tggtaggggt 
taccgctgga 
tcatccacgc 
ctctgcttgt 
caggaattta 
tgtaaacaaa 



tgttaactgc 
tctccaataa 
ggaggccaag 
cttcccacac 
tgaagtactc 
cctcaaagtt 
ctttgatcct 
tggccaggga 
ctcccctgag 
caagtaactc 
tgataaattc 
cttcctcctt 
ggtcactgat 
gcgtagataa 
atatgtgata 
tacaatctac 
acttcccaaa 
tgatggggct 
attccaaaga 
ttttacctgc 
ggcagcggta 
gccggctgta 
tcacagaggc 
ataagttata 
cctcctcgtt 
aagtgaatac 
cagtacttgg 



aactctgggc 
ggtgctgtct 
tttcttctca 
taatccttcc 
ctgaagtttc 
ttcaaatcca 
gctgttgatc 
gtccctgctg 
gaacattgat 
cgcagatctg 
tatgttccca 
gctgaactgc 
tgtcttgtct 
ataaaattct 
attcctttcc 
aattctcccg 
gcgactagag 
gctttcaaga 
ctgttgactg 
cccactttca 
gcactcgttg 
ctgctccatg 
caggatggag 
catgatggag 
cgtggggtgc 
ctgaccatag 
aaaatgctgg 



ttcacataaa 
gtggcttgag 
atcaagtcca 
ctgctatatt 
tcgtttgcat 
aagatgtcga 
ttcttgatta 
tctactgcct 
ctctgggtca 
cccaaagctg 
agatgcagta 
atcacgtcca 
tctacacatc 
tctctttctt 
ccgggatttt 
ccctgaatat 
ttgttgttgt 
atagctcgtt 
atgactgaca 
cccttgatga 
gcgatggcga 
gtggcaggct 
ccgatgtagg 
ccgccatgga 
atagcagtca 
tctgtccgga 
ccattttctc 



agtggttatt 
gaaaatggct 
ggcattctcc 
ctagttgttc 
agtttatatt 
ggatgccaat 
cccactcaaa 
gttgaacatt 
aagcatctgt 
ttttgaagga 
taccagcaag 
ttgccgtaat 
cagactgatt 
catgttccag 
gccttactac 
ttcctttctg 
acacggtctt 
caacacagga 
gaaacttgag 
ggatgcactg 
agatgtgcgg 
cgtacagccc 
tccatatttg 
gctctgtcaa 
ccttctggtg 
agacgacgat 
tcagccagac 



8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9621 



Table 



12 hMX2 polypeptide sequence (SEQ ID NO: 12) 

v^T^r^^ Hi£ 



10 



Phe Cys Leu Gin Gly 

. la rlll r i v Tie Val Val Phe Arg 
Phe Pro Ser Thr Val Asn Ser Cys Ala Glu Gly V ^ 

20 ZD 

n v»i Phe Thr Tyr Lys Gin Ser Thr He Thr His 

Thr Asp Tyr Gly Gin Val Phe Thr ryr uy ^ 



35 



G1 „ Lys V.! TK, >1, His P ro T „r » Ola Glu «y «1 *P «P 

50 " 

rlv Ser He Met Tyr Asn Leu 
Met Ala Ser Leu Thr Glu Leu His Gly Gly Ser 8Q 



65 



Phe 



Leu 



70 



G1 „ «. Tyr «, « <a» II- «P ™ *** »' Gly %f 
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100 








105 










110 






Ala 


Thr 


Met 
115 


Glu 


Gin 


Tyr 


Ser 


Arg 
120 


Arg 


His 


Leu 


Gly 


Glu 
125 


Leu 


Pro 


Pro 


His 


He 
130 


Phe 


Ala 


He 


Ala 


Asn 
135 


Glu 


Cys 


Tyr 


Arg 


Cys 
140 


Leu 


Trp 


Lys 


Arg 


His 
145 


Asp 


Asn 


Gin 


Cys 


He 
150 


Leu 


He 


Lys 


Gly 


Glu 
155 


Ser 


Gly 


Ala 


Gly 


Lys 
160 


Thr 


Glu 


Ser 


Thr 


Lys 
165 


Leu 


He 


Leu 


Lys 


Phe 
170 


Leu 


Ser 


Val 


He 


Ser 
175 


Gin 


Gin 


Ser 


Leu 


Glu 
180 


Leu 


Ser 


Leu 


Lys 


Glu 
185 


Lys 


Thr 


Ser 


Cys 


Val 
190 


Glu 


Arg 


Ala 


He 


Leu 
195 


Glu 


Ser 


Ser 


Pro 


He 
200 


Met 


Glu 


Ala 


Phe 


Gly 
205 


Asn 


Ala 


Lys 


Thr 


Val 
210 


Tyr 


Asn 


Asn 


Asn 


Ser 
215 


Ser 


Arg 


Phe 


Gly 


Lys 
220 


Phe 


Val 


Gin 


Leu 


Asn 
225 


He 


Cys 


Gin 


Lys 


Gly 
230 


Asn 


He 


Gin 


Gly 


Gly 
235 


Arg 


He 


Val 


Asp 


Cys 
240 


He 


Leu 


Ser 


Ser 


Gin 
245 


Asn 


Arg 


Val 


Val 


Arg 
250 


Gin 


Asn 


Pro 


Gly 


Glu 
255 


Arg 


Asn 


Tyr 


His 


He 
260 


Phe 


Tyr 


Ala 


Leu 


Leu 
265 


Ala 


Gly 


Leu 


Glu 


His 
270 


Glu 


Glu 


Arg 


Glu 


Glu 
275 


Phe 


Tyr 


Leu 


Ser 


Thr 
280 


Pro 


Glu 


Asn 


Tyr 


His 
285 


Tyr 


Leu 


Asn 


Gin 


Ser 
290 


Gly 


Cys 


Val 


Glu 


Asp 
295 


Lys 


Thr 


He 


Ser 


Asp 
300 


Gin 


Glu 


Ser 


Phe 


Arg 
305 


Glu 


Val 


He 


Thr 


Ala 
310 


Met 


Asp 


Val 


Met 


Gin 
315 


Phe 


Ser 


Lys 


Glu 


Glu 
320 


Val 


Arg 


Glu 


Val 


Ser 
325 


Arg 


Leu 


Leu 


Ala 


Gly 
330 


He 


Leu 


His 


Leu 


Gly Asn 
335 


He 


Glu 


Phe 


He 
340 


Thr 


Ala 


Gly 


Gly 


Ala 
345 


Gin 


Val 


Ser 


Phe 


Lys 
350 


Thr 


Ala 


Leu 


Gly 


Arg 
355 


Ser 


Ala 


Glu 


Leu 


Leu 
360 


Gly 


Leu 


Asp 


Pro 


Thr 
365 


Gin 


Leu 


Thr 


Asp 


Ala 
370 


Leu 


Thr 


Gin 


Arg 


Ser 
375 


Met 


Phe 


Leu 


Arg 


Gly 
380 


Glu 


Glu 


He 


Leu • 


Thr 
385 


Pro 


Leu 


Asn 


Val 


Gin 
390 


Gin 


Ala 


Val 


Asp 


Ser 
395 


Arg 


Asp 


Ser 


Leu 


Ala 
400 


Met 


Ala 


Leu 


Tyr 


Ala 
405 


Cys 


Cys 


Phe 


Glu 


Trp 
410 


Val 


He 


Lys 


Lys 


He 
415 


Asn 


Ser 


Arg 


He 


Lys 
420 


Gly 


Asn 


Glu 


Asp 


Phe 
425 


Lys 


Ser 


He 


Gly 


He 
430 


Leu 


Asp 


He 


Phe 


Gly 
435 


Phe 


Glu 


Asn 


Phe 


Glu 
440 


Val 


Asn 


His 


Phe 


Glu 
445 


Gin 


Phe 


Asn 


He 


Asn 


Tyr 


Ala 


Asn 


Glu 


Lys 


Leu 


Gin 


Glu 


Tyr 


Phe 


Asn 


Lys 


His 


He 



# 

re- 



iser 455 



460 



Phe Ser Leu Glu Gin Leu Glu Tyr Ser Arg Glu Gly Leu Val Trp Glu 
465 470 475 

Asp lie Asp Trp lie Asp Asn Gly Glu Cys Leu Asp Leu He Glu Lys 



485 



Lys Leu Gly Leu Leu Ala Leu He Asn Glu Glu Ser His Phe Pro Gin 
500 505 

Ala Thr Asp Ser Thr Leu Leu Glu Lys Leu His Ser Gin His Ala Asn 

515 520 
Asn His Phe Tyr Val Lys Pro Arg Val Ala Val Asn Asn Phe Gly Val 

530 535 S 

Lys His Tyr Ala Gly Glu Val Gin Tyr Asp Val Arg Gly He Leu Glu 
545 550 

Lys Asn Arg Asp Thr Phe Arg Asp Asp Leu Leu Asn Leu Leu Arg Glu 



565 



Ser Arg Phe Asp Phe He Tyr Asp Leu Phe Glu His Val Ser Ser Arg 
580 585 

Asn Asn Gin Asp Thr Leu Lys Cys Gly Ser Lys His Arg Arg Pro Thr 
595 600 605 

Val Ser Ser Gin Phe Lys Val Asp Ser Leu His Ser Leu Met Ala Thr 
610 615 620 

Leu Ser Ser Ser Asn Pro Phe Phe Val Arg Cys He Lys Pro Asn Met 
625 630 635 

Gin Lys Met Pro Asp Gin Phe Asp Gin Ala Val Val Leu Asn Gin Leu 

t- a r- 650 

He Arg Lys 

660 " " " 665 670 



645 

Arg Tyr Ser Gly Met Leu Glu Thr Val Arg He Arg Lys Ala Gly Tyr 



Ala Val Arg Arg Pro Phe Gin Asp Phe Tyr Lys Arg Tyr Lys Val Leu 
675 680 685 

Met Arg Asn Leu Ala Leu Pro Glu Asp Val Arg Gly Lys Cys Thr Ser 
690 695 ' uu 

Leu Leu Gin Leu Tyr Asp Ala Ser Asn Ser Glu Trp Gin Leu Gly Lys 
705 710 

Thr Lys Val Phe Leu Arg Glu Ser Leu Glu Gin Lys Leu Glu Lys Arg 
725 730 

Arg Glu Glu Glu Val Ser His Ala Ala Met Val He Arg Ala His Val 

Leu Gly Phe Leu Ala Arg Lys Gin Tyr Arg Lys Val Leu Tyr Cys Val 
755 7g 0 765 

Val He He Gin Lys Asn Tyr Arg Ala Phe Leu Leu Arg Arg Arg Phe 

770 775 780 

Leu His Leu Lys Lys Ala Ala He Val Phe Gin Lys Gin Leu Arg Gly 
785 7 90 795 

Gin He Ala Ara Arg Val Tvr Arc Gin Leu Leu Ala Glu Lys Arg Glu 
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805 



810 



815 



Gin- Glu Glu Lys Lys Lys Gin Glu Glu Glu Glu Lys Lys Lys Arg Glu 
820 825 830 

Glu Glu Glu Arg Glu Arg Glu Arg Glu Arg Arg Glu Ala Glu Leu Arq 
835 840 845 

Ala Gin Gin Glu Glu Glu Thr Arg Lys Gin Gin Glu Leu Glu Ala Leu 
850 855 860 

Gin Lys Ser Gin Lys Glu Ala Glu Leu Thr Arg Glu Leu. Glu Lys Gin 
865 870 875 Y 880 

Lys Glu Asn Lys Gin Val Glu Glu He Leu Arg Leu Glu Lys Glu He 
885 890 895 

Glu Asp Leu Gin Arg Met Lys Glu Gin Gin Glu Leu Ser Leu Thr Glu 
900 905 910 

Ala Ser Leu Gin Lys Leu Gin Glu Arg Arg Asp Gin Glu Leu Arq Ara 
915 920 925 

Leu Glu Glu Glu Ala Cys Arg Ala Ala Gin Glu Phe Leu Glu Ser Leu 
9 30 935 940 

Asn Phe Asp Glu He Asp Glu Cys Val Arg Asn He Glu Arg Ser Leu 
945 950 955 960 

Ser Gly Gly Ser Glu Phe Ser Ser Glu Leu Ala Glu Ser Ala Cys Glu 
965 970 975 

Glu Lys Pro Asn Phe Asn Phe Ser Gin Pro Tyr Pro Glu Glu Glu Val 
980 985 990 

Asp Glu Gly Phe Glu Ala Asp Asp Asp Ala Phe Lys Asp Ser Pro Asn 
995 1000 1005 

Pr ° Glu His Gly His Ser As P Gln Ar S Th *" Ser Gly He Arg 

1010 1015 1020 

Thr Asp Asp Ser Ser Glu Glu As P Pro TYr- Met Asn Asp Thr 

1025 1030 1035 

Val YnL Pr ° Thr Ser Pr ° Ser Ala As P Ser Thr Val Leu Leu Ala 
1040 1045 1050 

Pro Ser Val Gin Asp Ser Gly Ser Leu His Asn Ser Ser Ser Gly 
1055 1060 1065 

G1U ?n£n Thr ^ CyS Met Pro Gln Asn Ala G ly A SP Leu Pro Ser 
1070 1075 1080 

Pr ° ^ Gly Asp ^ Asp ^ As P Gln As P A sp Tyr Glu Asp Gly 
1085 1090 1095 

Ala ?^ Thr Ser Gly Ser Ser Val Thr Phe Ser Asn Ser Tyr Gly 
1100 H05 mo 

Ser Gln Trp Ser Pro Asp Tyr Arg Cys Ser Val Gly Thr Tyr Asn 
H15 H20 H25 

Ser Gly Ala ^ Arg Phe Ser Ser Glu Gi y A la Gln Ser Ser 

H30 H35 1140 

phe Glu — Asp Ser G lu Glu Asp Phe Asp Ser Arg Phe Asp Thr Asp 
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1145 










1150 










1155 








Asp 


Glu 
1160 


Leu 


Ser 


Tyr 


Arg 


Arg 
1165 


Asp 


Ser 


Val 


Tyr 


Ser 
1170 


Cys 


Val 


Thr 


Leu 


Pro 
1175 


Tyr 


Phe 


His 


Ser 


Phe 
1180 


Leu 


Tyr 


Met 


Lys 


Gly 
1185 


Gly 


Leu 


Met 


Asn 


Ser 
1190 


Trp 


Lys 


Arg 


Arg 


Trp 
1195 


Cys 


Val 


Leu 


Lys 


Asp 
1200 


Glu 


Thr 


Phe 


Leu 


Trp 
1205 


Phe 


Arg 


Ser 


Lys 


Gin 
1210 


Glu 


Ala 


Leu 


Lys 


Gin 
1215 


Gly 


Trp 


Leu 


His 


Lys 
1220 


Lys 


Gly 


Gly 


Gly 


Ser 
1225 


Ser 


Thr 


Leu 


Ser 


Arg 
1230 


Arg 


Asn 


Trp 


Lys 


Lys 
1235 


Arg 


Trp 


Phe 


Val 


Leu 
1240 


Arg 


Gin 


Ser 


Lys 


Leu 
1245 


Met 


Tyr 


Phe 


Glu 


Asn 
1250 


Asp 


Ser 


Glu 


Glu 


Lys 
1255 


Leu 


Lys 


Gly 


Thr 


Val 
1260 


Glu 


Val 


Arg 


Thr 


Ala 
1265 


Lys 


Glu 


He 


He 


Asp 
1270 


Asn 


Thr 


Thr 


Lys 


Glu 
1275 


Asn 


Gly 


He 


Asp 


He 
1280 


He 


Met 


Ala 


Asp 


Arg 
1285 


Thr 


Phe 


His 


Leu 


He 
1290 


Ala 


Glu 


Ser 


Pro 


Glu 
1295 


Asp 


Ala 


Ser 


Gin 


Trp 
1300 


Phe 


Ser 


Val 


Leu 


Ser 
1305 


Gin 


Val 


His 


Ala 


Ser 
1310 


Thr 


Asp 


Gin 


Glu 


He 
1315 


Gin 


Glu 


Met 


His 


Asp 
1320 


Glu 


Gin 


Ala 


Asn 


Pro 
1325 


Gin 


Asn 


Ala 


Val 


Gly 
1330 


Thr 


Leu 


Asp 


Val 


Gly 
1335 


Leu 


He 


7?sp 


Ser 


Val 
1340 


Cys 


Ala 


Ser 


Asp 


Ser 
1345 


Pro 


Asp 


Arg 


Pro 


Asn 
1350 


Ser 


Phe 


Val 


He 


He 
1355 


Thr 


Ala 


Asn 


Arg 


Val 
1360 


Leu 


His 


Cys 


Asn 


Ala 
1365 


Asp 


Thr 


Pro 


Glu 


Glu 
1370 


Met 


His 


His 


Trp 


He 
1375 


Thr 


Leu 


Leu 


Gin 


Arg 
1380 


Ser 


Lys 


Gly 


Asp 


Thr 
1385 


Arg 


Val 


Glu 


Gly 


Gin 
1390 


Glu 


Phe 


He 


Val 


Arg 
1395 


Gly 


Trp 


Leu 


His 


Lys 
1400 


Glu 


Val 


Lys 


Asn 


Ser 
1405 


Pro 


Lys 


Met 


Ser 


Ser 
1410 


Leu 


Lys 


Leu 


Lys 


Lys 
1415 


Arg 


Trp 


Phe 


Val 


Leu 
1420 


Thr 


His 


Asn 


Ser 


Leu 
1425 


Asp 


Tyr 


Tyr 


Lys 


Ser 
1430 


Ser 


Glu 


Lys 


Asn 


Ala 
1435 


Leu 


Lys 


Leu 


Gly 


Thr 
1440 


Leu 


Val 


Leu 


Asn 


Ser 
1445 


Leu 


Cys 


Ser 


Val 


Val 
1450 


Pro 


Pro 


Asp 


Glu 


Lys 

1455 


He 


Phe 


Lys 


Glu 


Thr 
1460 


Gly 


Tyr 


Trp 


Asn 


Val 
1465 


Thr 


Val 


Tyr 


Gly 


Arg 
1470 


Lys 


His 


Cys 


Tyr 


Arg 


Leu 


Tyr 


Thr 


Lys 


Leu 


Leu 


Asn 


Glu 


Ala 


Thr 


Arg 


Trp 


Ser 
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1475 148(f 1485 

Ser Val He Gin Asn Val Thr Asp Thr Lys Ala Pro He Asp Thr 
1490 • 1495 1500 

Pro Thr Gin Gin Leu He Gin Asp He Lys Glu Asn Cys Leu Asn 
1505 1510 15 1 5 

Ser Asp Val Val Glu Gin He Tyr Lys Arg Asn Pro He Leu Arg 
1520 1525 1530 

Tyr Thr His His Pro Leu His Ser Pro Leu Leu Pro Leu Pro Tyr 
y 1535 1540 1545 

Gly Asp He Asn Leu Asn Leu Leu Lys Asp Lys Gly Tyr Thr Thr 
1550 1555 1560 

Leu Gin Asp Glu Ala He Lys He Phe Asn Ser Leu Gin Gin Leu 
1565 1570 1575 

Glu Ser Met Ser Asp Pro He Pro He He Gin Gly He Leu Gin 
1580 1585 1590 

Thr Gly His Asp Leu Arg Pro Leu Arg Asp Glu Leu Tyr Cys Gin 
15 95 1600 1605 

Leu He Lys Gin Thr Asn Lys Val Pro His Pro Gly Ser Val Gly 
1610 1615 1620 

Asn Leu Tyr Ser Trp Gin lie Leu Thr Cys Leu Ser Cys Thr Phe 
1625 1630 1635 

Leu Pro Ser Arg Gly He Leu Lys Tyr Leu Lys Phe His Leu Lys 
1640 1645 1650 

Arg He Arg Glu Gin Phe Pro Gly Thr Glu Met Glu Lys Tyr Ala 
1655 1660 1665 

Leu Phe Thr Tyr Glu Ser Leu Lys Lys Thr Lys Cys Arg Glu Phe 
1670 1675 1680 

Val Pro Ser Arg Asp Glu He Glu Ala Leu He His Arg Gin Glu 
1685 1690 1695 

Met Thr Ser Thr Val Tyr Cys His Gly Gly Gly Ser Cys Lys lie 
1700 1705 1710 

Thr He Asn Ser His Thr Thr Ala Gly Glu Val Val Glu Lys Leu 
1715 1720 1725 

He Arg Gly Leu Ala Met Glu Asp Ser Arg Asn Met Phe Ala Leu 
1730 1735 1740 

Phe Glu Tyr Asn Gly His Val Asp Lys Ala He Glu Ser Arg Thr 
1745 1750 1755 

Val Val Ala Asp Val Leu Ala Lys Phe Glu Lys Leu Ala Ala Thr 
1760 1765 1770 

Ser Glu Val Gly Asp Leu Pro Trp Lys Phe Tyr Phe Lys Leu Tyr 
177 5 1780 1785 

Cys Phe Leu Asp Thr Asp Asn Val Pro Lys Asp Ser Val Glu Phe 
17 90 1795 I 800 

Ala Phe Met Phe Glu Gin Ala His Glu Ala Val He His Gly His 
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1805 










1810 










1815 








His 


Pro 
1820 


Ala 


Pro 


Glu 


Glu 


Asn 
1825 


Leu 


Gin 


Val 


Leu 


Ala 
1830 


Ala 


Leu 


Arg 


Leu 


Gin 
1835 


Tyr 


Leu 


Gin 


Gly 


Asp 
1840 


Tyr 


Thr 


Leu 


His 


Ala 
1845 


Ala 


lie 


Pro 


Pro 


Leu 
1850 


Glu 


Glu 


Val 


Tyr 


Ser 
1855 


Leu 


Gin 


Arg 


Leu 


Lys 
1860 


Ala 


Arg 


He 


Ser 


Gin 
1865 


Ser 


Thr 


Lys 


Thr 


Phe 
1870 


Thr 


Pro 


Cys 


Glu 


Arg 
1875 


Leu 


Glu 


Lys 


Arg 


Arg 
1880 


Thr 


Ser 


Phe 


Leu 


Glu 
1885 


Gly 


Thr 


Leu 


Arg 


Arg 
1890 


Ser 


Phe 


Arg 


Thr 


Gly 
1895 


Ser 


Val 


Val 


Arg 


Gin 
1900 


Lys 


Val 


Glu 


Glu 


Glu 
1905 


Gin 


Met 


Leu 


Asp 


Met 
1910 


Trp 


lie 


Lys 


Glu 


Glu 
1915 


Val 


Ser 


Ser 


Ala 


Arg 
1920 


Ala 


Ser 


He 


He 


Asp 
1925 


Lys 


Trp 


Arg 


Lys 


Phe 
1930 


Gin 


Gly 


Met 


Asn 


Gin 
1935 


Glu 


Gin 


Ala 


Met 


Ala 
1940 


Lys 


Tyr 


Met 


Ala 


Leu 
1945 


He 


Lys 


Glu 


Trp 


Pro 
1950 


Gly 


Tyr 


Gly 


Ser 


Thr 
1955 


Leu 


Phe 


Asp 


Val 


Glu 
1960 


Val 


Arg 


Thr 


Gly 


Cys 
1965 


His 


Val 


Leu 


Gly 


Trp 
1970 


Ala 


Gly 


Cys 


Trp 


His 
1975 


Leu 


Arg 


Thr 


Trp 


He 
1980 


Thr 


Ala 


Lys 


Phe 


Met 
1985 


Trp 


Arg 


Glu 


Asp 


Lys 
1990 


Met 


Glu 


His 


Phe 


Ala 
1995 


Leu 


Ser 


Thr 


Ser 


Phe 
2000 


Phe 


Arg 


Ala 


Pro 


Lys 
2005 


He 


Val 


Pro 


Leu 


Thr 
2010 


Pro 


Pro 


Phe 


Ser 


Ser 
2015 


Gin 


Phe 


Leu 


Phe 


Ser 
2020 


Cys 


Val 


Val 


Asn 


Ala 
2025 


Ser 


Val 


He 


Leu 


Gly 
2030 


Met 


Asn 


Ala 


Lys 


Leu 
2035 


Arg 


Cys 


His 


Leu 


Phe 
2040 


Phe 


Tyr 


Pro 


Ser 


Leu 
2045 


Gly 


Lys 


Leu 























Table 13 hMP nucleotide sequence (SEQ ID NO: 13) 



ccaacttttg 


cagctccacc 


caggatgtgg 


cctcgctcca 


ccccagctgt 


gcgcctctct 


60 


ccacccctag 


gcgaaggcac 


tagaatttcc 


caaattaaga 


acgaagagga 


agtttggacc 


120 


ttttcggcca 


ccgctcgctt 


caatatggct 


gcccccaggg 


agagacgagg 


ctaccatgaa 


180 


ggagccgagc 


gcagaccctg 


agtccgtcac 


ccatgqatcq 


cagcgcggag 


ttcaggaaat 


240 


ggaaggcgca 


atgtttgagc 


aaagcggacc 


tcagccggaa 


gggcagtgtt 


gacgaggatg 


300 


tggtagagct 


tgtgcagttt 


ctgaacatgc 


gagatcagtt 


tttcaccacc 


agctccttcg 


360 


ctggccgcat 


cctactcctt 


gaccggggta 


taaatggttt 


tgaggttcag 


aaacaaaact 


420 


gttgctggct 


actggttaca 


cacaaacttt 


gtgtaaaaga 


tgatgtgatt 


gtagctctga 


480 



-45- 



agaaagcaaa 

gacaattgca 

ctggcataac 

tagaagttcc 

tgttaaatgt 

actgcctaca 

agaaaaataa 

agtgtattac 

tcaatgttac 

atgtttctag 

atcagccatt 

cagaggatgt 

cctgaaatgc 

taaatcaaag 



tggtgatgcc 

ggatgcacag 

ggtgggaaag 

attaagccat 

ggcaaatcaa 

gcatgctttg 

ctcatcatat 

taaagaaagt 

catcttccct 

taggttttat 

cataagccag 

gcagttgtca 

tatagttatc 

atgtttgaaa 



actttgaaat 
attctgcatt 
agaggaaaaa 
aagggaaaac 
aaaatggagg 
gaaagggaaa 
attcataaga 
gatgaagaac 
gaagattact 
aaagctgctc 
taatgacaag 
tctaagctct 
cctacttttt 
aaaaaaaa 



ttgaaccatt 

ccatggcaat 

ctatgttggc 

tgatggtgac 

aaaacaagaa 

cgatgactaa 

aaaaaagaaa 

ttgaaaatga 

aagctttggt 

ttcataagag 

tgcagagctt 

cagcagtacc 

taccagtttc 



tgttcttcat 

agattctggt 

tgtccggagt 

agaggaatat 

aagaattgag 

cttacatccc 

cccagaaaaa 

tgatgatgat 

tctgatgtgt 

tattttagtt 

caaactataa 

cggcttatcc 

tcccagaagc 



gtgcagtgtc 

ttcaggaact 

acacatggct 

attgacttcc 

aggttttaca 

aagatcaaag 

acacgtgccc 

gatctaggaa 

cttggccgta 

tgttgagtgt 

ctttgttgcc 

tacgacttca 

acctgcttaa 



Table 14 hMP polypeptide sequence (SEQ ID NO: 14) 



Met Asp Arg Ser Ala Glu Phe Ar g Lys Trp Lys Ala Gin Cys Leu 

L ys Ala Asp Leu Ser Arg Lys Gly Ser Val Asp Glu Asp Val Val Glu 

20 z ^ 
Leu Val Gin Phe Leu Asn Met Arg Asp Gin Phe Phe Thr Thr Ser Ser 

35 40 
Phe Ala Gly Arg He Leu Leu Leu Asp Arg Gly lie Asn Gly Phe Glu 

50 55 
Val Gin Lys Gin Asn Cys Cys Trp Leu Leu Val Thr His Lys Leu Cys 
65 70 75 

val Lys Asp Asp Val He Val Ala Leu Lys Lys Ala Asn Gly Asp Ala 

85 yu 

Thr Leu Lys Phe Glu Pro Phe Val Leu His Val Gin Cys Arg Gin Leu 

100 105 

Gin Asp Ala Gin He Leu His Ser Met Ala He Asp Ser Gly Phe Arg 

115 120 

Asn Ser Gly He Thr Val Gly Lys Arg Gly Lys Thr Met Leu Ala Val 

130 135 
Arg Ser Thr His Gly Leu Glu , Val Pro Leu Ser His Lys Gly Lys Leu 

Met Val Thr Glu Glu Tyr He Asp Phe Leu Leu Asn Val Ala Asn Gin 

165 1/0 

Lys Met Glu Glu Asn Lys Lys Arg He Glu Arg Phe Tyr Asn Cys Leu 

180 iyb 
Gin His Ala Leu Glu Arg Glu Thr Met Thr Asn Leu His Pro Lys lie 

195 200 
t. yr Glu Lvs A~n A»n Ser Se r Tyr lie H is l .ys T. ys T.ys Arg Asn Pro 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1288 
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210 



215 



220 



Glu Lys Thr Arg Ala Gin Cys He Thr Lys Glu Ser Asp Glu Glu Leu 

225 230 

Glu Asn Asp Asp Asp Asp Asp Leu Gly lie Asn Val Thr He Phe Pro 

Glu Asp Tyr 



Table 15 NHR nucleotide sequence (SEQ ID NO: 15) 



acgcgtgcag 
tcttccgagt 
actccctcaa 
tgagggcaac 
gaaatgagaa 
agatcagcaa 
aaggtgcaag 
agtaatccca 
caaaagttcc 
acagagattc 
aacctggaaa 
ctcattttaa 
ttgaagaaca 
ggactccagt 
gctcgcaagg 
tcaagcgctc 
ataatgagca 
tgtctggggg 
cggggaattc 
cagtctccaa 
atgaaccaca 
atcaacatgt 
aaaggaagag 
ggaatgcgaa 
ttcctgtatt 
ttagtcacga 
catgtgctat 
ttttacttaa 
ctagtttttt 
cagttccctc 
ttacctgatg 
gtaagctggg 
acggggccat 
tatgtacctt 



gtggcgtggc 
atcgccggga 
gtacagtgac 
caagttgtta 
tcaggatgaa 
ccaggaagag 
actgtccgtg 
ctgaattcca 
ttctccacca 
aaaggtacct 
aaataaaaga 
ggaaatggag 
caattccatg 
acctccaaga 
ccggtcttgt 
tgctatctct 
taagcgttca 
cacccaaaaa 
tgctgctgtt 
taagaaacca 
caaaggaaag 
caacaaatta 
caacggaaga 
ggggcctcat 



ctcaactttt 
gatctttttc 
gatctctgaa 
gtccattaac 
aatgtcacca 
ttatcgtact 
gcagctagtt 
atagaaaggc 
atagtttggt 
cgttcaaata 



gccagggatt 
tttcgaatcg 
ctgcagaact 
aaagccttga 
agtcaaactt 
ctgagagaca 
tggaccctga 
gaatcatgaa 
gacgagcacc 
tcagaaggaa 
actgcaatca 
tccattgatc 
aatgaactga 
ggaagactct 
ggccctgcaa 
gcagctaaaa 
ctgaccaaga 
ggcgaggctg 
attaccccat 
gtgtttgatc 
ctaaaaccat 
acttctacaa 
aacgcgagca 
tttggctgaa 
ttccttttgt 
tgctaactgt 
aagacgttat 
aattcaggtt 
tcccaaactc 
aatattaaca 
atgcaagctt 
cacctcttca 
tatgacatca 
tcctcatgta 



tgaaccgcgc 
ca atg atcat 
tagccaagag 
aaggctacat 
ctgcatcctc 
gccacttggc 
ctcacagaga 
aagcaggaaa 
aagaagctga 
agaaatctct 
ctactccaaa 
caatatatng 
agcagccgcc 
ctgtggcttc 
gtcagagtac 
cgggtgtcag 
ctccagccag 
tgcttgggac 
tcaagttgac 
ttaaagcaag 

gggggcaatc 

gaaaacttac 
agaagaaagg 
gattaataat 
aaattttttt 
tcatagtctg 
caccttaaag 
tctaacgaga 
ccgtttctgg 
gaactgcagt 
caggagaatt 
ctctctatag 
atattttacc 
attgccatct 



tgacgaagtt 
cccctctcta 
tctgggtctc 
taaacatgag 
ttgtgatgag 
catgtcacca 
atcattcaga 
gccaggatct 
gaatgctgtt 
ctacacagat 
ctttaagaag 
aggagaaaaa 
catcaataag 
tactcccatc 
cttgggtctg 
gttttcagct 
aaagtctgca 
acacaaatta 
aactgaggca 
tttgtctcgt 
taaagaaaat 
aaacaacccc 
agaagaaagc 
tttttaacat 
tttttgctgt 
tgtagtgtcc 
ctcaaattct 
cccatcctaa 
atttttaatc 
cttctgctag 
tgaacaataa 
aatatagtaa 
taggtgaaat 
gtcactcact 



tggtgatcca 
gaggagctgg 
cgggccaacc 
gcaagaaaag 
actgagatac 
aaacaaggag 
gataaaaata 
cagagcactg 
tcctcaggta 
gagtcatcca 
cttcatgaag 
aagaaacatt 
ggaggggtca 
agccaacgac 
aaggggtcac 
gctactaaag 
catgtgaccg 
aagaccatca 
acgcagactc 
cccctcaact 
aattatctaa 
atctccagac 
aaaggttttg 
cttgtaaata 
catccccact 
atgggttctt 
ttgggatggt 
aattctcttt 
cccagctccc 
ccaatagcat 
caagaatagg ■ 
cctttatgaa 
tgtttaggct 
atattcacaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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aaataaaact 


ctacaactca 


ttctaacatt 


gcttacttaa 


aagctacata 


gccctatcga 


2100 


aatgcgagga 


ttaatgcttt 


aatgctttta 


gagacagggt 


ctcactgtgt 


tgcccaggct 


2160 


ggtctcaaac 


tccaccaaat 


gtacttctta 


ttcattttat 


ggaaaagact 


aggctttgct 


2220 


tagtatcatg 


tccatgtttc 


cttcacctca 


gtggagcttc 


tgagttttat 


actgctcaag 


2280 


atcgtcataa 


ataaaatttt 


ttctcattgt 


caaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


2340 


aaaaaaaaaa 


aa 










2352 



Table 16 NHR polypeptide sequence (SEQ ID NO: 16) 



Met 
1 


He 


He 


Pro 


Ser 
5 


Leu 


Glu 


Glu 


Leu 


Asp 
10 


Ser 


Leu 


Lys 


Tyr 


Ser 
15 


Asp 


Leu 


Gin 


Asn 


Leu 
20 


Ala 


Lys 


Ser 


Leu 


Gly 
25 


Leu 


Arg 


Ala 


Asn 


Leu 
30 


Arg 


Ala 


Thr 


Lys 


Leu 
35 


Leu 


Lys 


Ala 


Leu 


Lys 
40 


Gly 


Tyr 


He 


Lys 


His 
45 


Glu 


Ala 


Arg 


Lys 


Gly Asn 
50 


Glu 


Asn 


Gin 


Asp 
55 


Glu 


Ser 


Gin 


Thr 


Ser 
60 


Ala 


Ser 


Ser 


Cys 


Asp 
65 


Glu 


Thr 


Glu 


He 


Gin 
70 


He 


Ser 


Asn 


Gin 


Glu 
75 


Glu 


Ala 


Glu 


Arg 


Gin 
80 


Pro 


Leu 


Gly 


His 


Val 
85 


Thr 


Lys 


Thr 


Arg 


Arg 
90 


Arg 


Cys 


Lys 


Thr 


Val 
95 


Arg 


Val 


Asp 


Pro 


Asp 
100 


Ser 


Gin 


Gin 


Asn 


His 
105 


Ser 


Glu 


He 


Lys 


He 
110 


Ser 


Asn 


Pro 


Thr 


Glu 
115 


Phe 


Gin 


Asn 


His 


Glu 
120 


Lys 


Gin 


Glu 


Ser 


Gin 
125 


Asp 


Leu 


Arg 


Ala 


Thr 
130 


Ala 


Lys 


Val 


Pro 


Ser 
135 


Pro 


Pro 


Asp 


Glu 


His 
140 


Gin 


Glu 


Ala 


Glu 


Asn 


Ala 


Val 


Ser 


Ser 


Gly 
150 


Asn 


Arg 


Asp 


Ser 


Lys 
155 


Val 


Pro 


Ser 


Glu 


Gly 
160 


Lys 


Lys 


Ser 


Leu 


Tyr 
165 


Thr 


Asp 


Glu 


Ser 


Ser 
170 


Lys 


Pro 


Gly 


Lys 


Asn 
175 


Lys 


Arg 


Thr 


Ala 


He 
180 


Thr 


Thr 


Pro 


Asn 


Phe 
185 


Lys 


Lys 


Leu 


His 


Glu 
190 


Ala 


His 


Phe 


Lys 


Glu 
195 


Met 


Glu 


Ser 


He 


Asp 
200 


Pro 


He 


Tyr 


Xaa 


Gly 
205 


Glu 


Lys 


Lys 


Lys 


His 
210 


Phe 


Glu 


Glu 


His 


Asn 
215 


Ser 


Met 


Asn 


Glu 


Leu 
220 


Lys 


Gin 


Pro 


Pro 


lie 
225 


Asn 


Lys 


Gly 


Gly Val 
230 


Arg 


Thr 


Pro 


Val 


Pro 
235 


Pro 


Arg 


Gly 


Arg 


Leu 
240 


Ser 


Val 


Ala 


Ser 


Thr 
245 


Pro 


He 


Ser 


Gin 


Arg 
250 


Arg 


Ser 


Gin 


Gly 


Arg 
255 


Ser 


Cys 


Gly 


Pro 


Ala 
260 


Ser 


Gin 


Ser 


Thr 


Leu 
265 


Gly 


Leu 


Lys 


Gly 


Ser 
270 


Leu 


Lys 


Arg 


Ser 


Ala 


He 


Ser 


Ala 


Ala 


Lys 


Thr 


Giy 


Val 


Arg 


Phe 


Ser 


Ala 


Ala 
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275 

Thr Lys Asp Asn Glu His 
290 



Lys Ser Ala His 
305 

Val Leu Gly Thr 



Val Thr 
310 



280 

Lys Arg Ser Leu Thr Lys 
295 300 

Val Ser Gly Gly Thr Gin 
315 



Val lie Thr Pro 
340 

Ser Asn Lys Lys 
355 

Leu Asn Tyr Glu 
370 

Lys Glu Asn Asn 
385 

Lys Lys. Thr Tyr 



Lys Lys Arg Glu 
420 

Met Arg Arg Gly 
435 



His Lys 
325 

Phe Lys 

Pro Val 

Pro His 

Tyr Leu 
390 

Lys Gin 
405 

Gin Glu 
Leu He 



Leu Lys 

Leu Thr 

Phe Asp 
360 

Lys Gly 
375 

Asn Gin 
Pro His 
Arg Lys 



Thr He Thr Gly 
330 

Thr Glu Ala Thr 
345 

Leu Lys Ala Ser 



Lys Leu Lys Pro 
380 

His Val Asn Arg 
395 

Leu Gin Thr Lys 
410 

Glu Lys Lys Ala 
425 



285 

Thr Pro Ala Arg 

Lys Gly Glu Ala 
320 

Asn Ser Ala Ala 
335 

Gin Thr Pro Val 
350 

Leu Ser Arg Pro 
365 

Trp Gly Gin Ser 



He Asn Phe Tyr 
400 

Glu Glu Gin Arg 
415 

Lys Val Leu Gly 
430 



Leu Ala 
440 



Glu Asp 



Table 17 displays alignment of hMXl, hMX2 with human myosin (SEQ ID 
NO-31- GenBank AF247457) (Berg et aU 2000). As seen from the alignment, hMXl and 
hMX2 have a likely N-«erminus of MND residues. One of skill in the an can eas.ly 
verify this observation by probing cDNA or genomrc human libranes, or PCR techniques, 

to acquire the full length polynucleotide sequence. 
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12 
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Table 17 Alignment of hMXl, hMX2 and human myosin X 
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10 898 
12 898 



10 948 
12 948 
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io 1048 STVLLAPSVQDSGSLHNS S SGE STYCMPQNAGDL PS PDGDYDYDQDDYED 

12 1048 STVLLAPSVQDSGSLHNS S SGESTYCMPQNAGDL PS PDGDYDYDQDDYED 

humX 1049 STVLLAPSVQDSGSLHNSSSGESTYCMPQNAGDLPS PDGDYDYDQDDYED 

10 1098 GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 

12 1098 GAITSGSSVTFSNSYGSQWSPDYRCSVGTYNSSGAYRFSSEGAQSSFEDS 

humX 1099 GAITSGSSVTFSNSYGSQWS PD YRC S VGTYNS SGAYRF S S EGAQS S FEDS 

10 1148 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

12 1148 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

humX 1149 EEDFDSRFDTDDELSYRRDSVYSCVTLPYFHSFLYMKGGLMNSWKRRWCV 

10 1198 LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMY 

12 1198 LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMY 

humX 1199 LKDETFLWFRSKQEALKQGWLHKKGGGSSTLSRRNWKKRWFVLRQSKLMY 

10 1248 F END S EEKLKGTVEVRT AKE 1 1 DNTTKENG I DI IMADRTFHLI AESPEDA 

12 1248 F ENDS EEKLKGTVEVRT AKE 1 1 DNTTKENG I D I IMADRTFHLIAESPEDA 

humX 1249 F ENDS EEKLKGTVEVRTAKE I I DNTTKENGID I IMADRTFHLI AESPEDA 

10 12 98 SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD 

12 12 98 SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD 

humX 12 99 SQWFSVLSQVHASTDQEIQEMHDEQANPQNAVGTLDVGLIDSVCASDSPD 

10 1348 RPNSFVI ITANRVLHCNADTPEEMHHWITLLQRSKGDTRVEGQEFIVRGW 

12 13 48 RPNSFVI ITANRVLHCNADTPEEMHHWITLLQRSKGDTRVEGQEFIVRGW 

humX 1349 RPNSFVI I TANRVLHCNADT PEEMHHW I TLLQRS KGDTRVEGQEF I VRGW 

10 13 98 LHKEVKNS PKMS SLKLKKRWFVLTHNSLDYYKS SEKNALKLGTLVLNSLC 

12 13 98 LHKEVKNSPKMS SLKLKKRWFVLTHNSLDYYKS SEKNALKLGTLVLNSLC 

humX 1399 LHKEVKNS PKMS SLKLKKRWFVLTHNSLDYYKS SEKNALKLGTLVLNSLC 

10 144 8 SWPPDEKI FKETGYWNVTVYGRKHC YRLYTKLLNEATRWSSVI QNVTDT 

12 1448 SWP PDEKI FKETGYWNVTVYGRKHC YRLYTKLLNEATRWS S VI QNVTDT 

humX 1449 SWP PDEKI FKETGYWNVTVYGRKHCYRLYTKLLNEATRWSSAI QNVTDT 

10 1498 KAPIDTPTQQLIQDIKENCLNSDWEQIYKRNPILRYTHHPLHSPLLPLP 

12 1498 KAPIDTPTQQLIQDIKENCLNSDWEQIYKRNPILRYTHHPLHSPLLPLP 

humX 1499 KAPIDTPTQQLIQDIKENCLNSDWEQIYKRNPILRYTHHPLHSPLLPLP 

10 154 8 YGDINLNLLKDKGYTTLQDEAIKI FNSLQQLESMSDPI PI IQGILQTGHD 

12 154 8 YGDINLNLLKDKGYTTLQDEAIKI FNSLQQLESMSDPI PI IQGILQTGHD 

humX 1549 YGDINLNLLKDKGYTTLQDEAIKIFNSLQQLESMSDPI PI IQGILQTGHD 

10 1598 LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY 

12 1598 LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY 

humX 1599 LRPLRDELYCQLIKQTNKVPHPGSVGNLYSWQILTCLSCTFLPSRGILKY 

10 1648 LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ 

12 1648 LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ 

humX 1649 LKFHLKRIREQFPGTEMEKYALFTYESLKKTKCREFVPSRDEIEALIHRQ 

10 1698 EMT S TVYC HGGG SC K I T I NS HTT AGEWE KL I RGLAMED S RNMF AL F E YN 

12 1698 EMTSTVYCHGGGSCKITINSHTTAGEWEKLIRGLAMEDSRNMFALFEYN 

humX 1699 EMTSTVYCHGGGSCKITINSHTTAGEWEKLIRGLAMEDSRNMFALFEYN 

10 1748 GHVDKAI ES RTVVADVLAKFEKLAAT S EVGDL PWKF YFKL YC FLDTDNVP 

12 1748 GHVDKAI ESRTVVADVLAKFEKLAATS EVGDL PWKF YFKL YC FLDTDNVP 

humX 1749 GHVDKAI ESRTVVADVLAKFEKLAATS EVGDL PWKF YFKL YC FLDTDNVP 

10 17 98 KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAI 

12 17 98 KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAI 

humX 17 99 KDSVEFAFMFEQAHEAVIHGHHPAPEENLQVLAALRLQYLQGDYTLHAAI 

10 184 8 PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 
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PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 
PPLEEVYSLQRLKARISQSTKTFTPCERLEKRRTSFLEGTLRRSFRTGSV 
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VRQKVEEEQMLDMWI KEEVS SARAS 1 1 DKWRKFQGMNQEQAMAKYMAL I K 
VRQKVEEEQMLDMWI KEEVS SARAS I IDKWRKFQGMNQEQAMAKYMAL I K 
VRQKVEEEQMLDMWI KEEVS SARAS I I DKWRKFQGMNQEQAMAKYMAL I K 
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EWPGYGSTLFDVECKEGGFPQELWIiGVSADAVSVYKRGEGRPLEVFQYEH 
EWPGYGSTLFDVEVRTG-CHVLGWAGCWHLRTWITAKFMWREDKMEHFAL 
EWPGYGSTIiFDVECKEGGFPQELWLGVSADAVSVYKRGEGRPLEVFQYEH 



10 

12 

humX 



1998 
1997 
1999 



ILSFGAPLANTYKIWDERELLFETSEWDVAKLMKAYISMIVKKRYSTT 
STSFFRAPKIVPLTPPFSSQFLFSCWNASVILGMNAKLRCHLFFYPSLG 
ILSFGAPLANTYKIWDERELLFETSEWDVAKLMKAYISMIVKKRYSTT 



10 
12 
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KLr 



humX 2 04 9 RSASSQGSSR 



The invention also includes polypeptides and nucleotides having 80-100%, 
including 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, 
sequence identity to SEQ ID NOS:l-16, as well as nucleotides encoding any of these 
polypeptides, and compliments of any of these nucleotides. In an alternative 
embodiment, polypeptides and/or nucleotides (and compliments thereof) identical to any 
one of, or more than one of, SEQ ID NOS:l-16 are excluded. In yet another embodiment, 
polypeptides and/or nucleotides (and compliments thereof) having 81-100% identical, 
including 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98 and 99%, 
sequence identity to SEQ ED NOS:l-16 are excluded. 

The nucleic acids and proteins of the invention are po tentially useful in promot ing 
wound healing, for example after organ transplantation, or in the treatment of myocardial 
infarction, but also in treating tumors, and in cancers, diabetic retinopathy, macular 
degeneration, psoriasis, and rheumatoid arthritis. For example, a cDNA encoding AAP 
may be useful in gene therapy, and AAP proteins may be useful when administered to a 
subject in need thereof. The novel nucleic acid encoding AAP, and the AAP proteins of 
the invention, or fragments thereof, may further be useful in diagnostic applications, 



wherein the presence or amount of the nucleic acid or the protein are to oe assessed. 
These materials are further useful in the generat ionjofAhs>that bind immunospecifically 
to the novel substances of the invention for use in therapeutic or diagnostic methods. 




Kelch-like protein (KLP) 
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The putative protein encoded by KLP contains 1 putative BTB domain and 4 
putative Kelch motifs. The BTB (broad complex, tramtrack, bric-a-brac)/POZ (poxvirus, 
zinc finger) domain is involved in protein protein interactions. The kelch motif is sixfold 
tandem element in the sequence of the Drosophila kelch ORF1 protein that also contains 
BTB. Kelch ORF1 localizes to the ring canals in the egg chamber and helps to organize 
the F-actin cytoskeleton (Adams et al, 2000). The repeated kelch motifs predict a 
conserved tertiary structure, a p-propeller. This module appears in many different 
polypeptide contexts and contains multiple potential protein-protein interaction sites. 
Members of this growing superfamily are present throughout the cell and extracellularly 
and have diverse activities (Adams et aL, 2000). Such activities include cytoskeleton 
organization, as well as other morphological processes, gene expression, interactions with 
viruses, and various extracellular events, such as cell spreading. 

Alignment with Drosophila kelch and other kelch-like proteins, human kelch-like 
protein (GenBank AAF20938 (SEQ ID NO: 17)), hypothetical C elegans (GenBank 
061795 (SEQ ID NO: 18) and the skeletal muscle-specific sarcosin (GenBank 060662 
(SEQ ID NO: 19); (Taylor et al, 1998)) reveals that the disclosed protein (SEQ ID NO:2) 
is a member of a new subfamily 

KLP is associated with tube formation and angiogenesis because it is upregulated 
in the in vitro model of angiogenesis of Example 1 . Kelch mediates cytoskeletal 
associations, it is involved in morphogenetic processes, such as tube formation, that 
depend on cytoskeletal arrangements and signaling. KLP represents an attractive target 
for small molecule drug therapy. 

Human ortholog of mouse BAZF (hBAZF) 

hBAZF (SEQ ID NO:4) is the human ortholog of mouse BAZF (GenBank 
AB011665; SEQ ID NO:20), BAZF is a Bcl-6 (LAZ3) homolog, a transcription repressor 
that controls germinal center formation and the T cell-dependent immune response. 
Expression of Bcl-6 negatively correlates with cellular proliferation: Bcl-6 suppresses 
growth associated with impaired mitotic S phase progression and apoptosis (Albagli et 
aL, 1999). 

BAZF contains a BTB/POZ domain and five repeats of the Kruppel-like zinc 
finger motifs, instead of 6 in Bcl-6 (Okabe et al. 9 1998). Expression of BAZF mRNA is 
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relegated to heart and lung, unlike Bcl-6 mRNA, but is induced in activated lymphocytes 
as an immediate-early gene, like Bcl-6 (Okabe et al., 1998). 

The hBAZF sequence was derived by using tblastn (protein query -translated 
database) (Altschul et al., 1997), with the mouse protein sequence (GenBank 088282; 
SEQ ID NO:21) that has homology to GenBank AC015918 (SEQ ID NO:22), a clone of 
Homo sapiens chromosome 17. Human BAZF contains five Kruppel-like zinc finger 
motif repeats and a BTB/POZ domain. 

The peptide sequence, "RSQ...PQV" that is present in the human sequence, might 
represent an alternative spliced form of the gene. Aligment with mouse BAZF, and 
aligment with mouse and human Bcl-6 demonstrates that the four proteins are almost 
identical in this region, but only human BAZF has this inserted sequence. 

hBAZF is upregulated in HUVE cells grown embedded in collagen gels but not as 
a monolayer grown on collagen. When HUVE cells are suspended in collagen, they do 
not proliferate. Analagous to the role of mBAZF plays a role in regulating cell 
proliferation (Okabe et al 9 1998), hBAZF plays a roll in cell proliferation in HUVE 
suspended in collagen. Because of its high expression during vessel morphogenesis, 
hBAZF represents an. excellent molecular marker, as well as an attractive target for 
various therapies to inhibit angiogenesis. 

hmt- Elongation Factor G (hEF-G) 

The original isolation of hEF-G (SEQ ID NO:6) is 84% identical and colinear 
with Rattus norvegicus nuclear encoded mitochondrial elongation factor G (GenBank 
L14684 (SEQ ID NO:23); (Barker et a/., 1993). No human gene is described in 
GenBank. However, searching EST databases, the human gene is contained inside 
GenBank AC010936 (SEQ ID NO:24), a chromosome 3 clone. Aligment of hEF-G with 
rat mtEF-G and yeast EF-G1 demonstrates that the novel sequence is the ortholog of rat 
nuclear-encoded mitochondrial elongation factor G. 

Bacterial elongation factor G (EF-G) physically associates with translocation- 
competent ribosomes and facilitates transition to the subsequent codon through the 
coordinate binding and hydrolysis of GTP. The deduced amino acid sequence of hmt-EF- 
G reveals characteristic motifs shared by all GTP binding proteins. Therefore, similarly 

l 

\ 
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to other elongation factors, the enzymatic function of hmt-EF-G is predicted to depend on 
GTP binding and hydrolysis. 

Hmt-EF-G is strongly induced (30-fold) in an in vitro model of angiogenesis 
(Example 1), and as such, hmt-EF-G represents an excellent molecular marker for vessel 
formation. Because of its putative localization to the mitochondrion, hmt-EF-G is also an 
attractive therapeutic target to treat disease states associated with mitochondrial 
dysfunction. 

Human thyroid regulated transcript (hTRG) t 

hTRG (SEQ ID NO:8) is the human ortholog of rat TRG, a novel thyroid 
transcript negatively regulated by TSH (GenBank KIAA1058 (SEQ ID NO:25); 
(Bonapace et aL, 1990). 

SEQ ID NO:25 appears to be a partial peptide since there are C elegans 
homologous proteins of 2000 residues. Using tblastn (Altschul et aL, 1997) against 
genomic sequences, the hTRG sequence (SEQ ID NO: 8) was assembled. 

In C. elegans, homologous proteins localize either to the plasma membrane or to 
the mitochondrial inner membrane. A partial sequence, KIAA0694 (SEQ ID NO:26) also 
localizes to the mitochondrial matrix. hTRG has a PH domain, and has weak homology 
to an extracellular fibronectin-binding protein precursor. SEQ ID NO:26 has homology 
to Drosophila DOS and mouse Gab-2 proteins; both of which are involved in signal 
transduction, acting as adapter proteins between receptors and kinases like Rasl (Hibi and 
Hirano, 2000). 

Because of hTRG is upregulated during the in vitro model of angiogenesis 
(Example 1), and because of its homologies with adapter proteins, hTRG is likely to be 
involved in signal transduction between receptors and kinases. As such, hTRG represent 
an excellent candidate for small molecule drug therapy to modulate angiogenesis and treat 
angiogenesis-related diseases. In addition, because of its putative ability to respond to 
thyroid stimulating hormone (TSH), modulation of hTRG is useful to treat diseases 
related to TSH imbalance. 



Human myosin X (hMXl(SEQ ID NO: 10) and hMX2 (SEQ ID NO: 12) 
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The hMX proteins represent the human ortholog of bovine myosin X, (GenBank 
AAB39486; SEQ ID NO:27). Using tblastn (Altschul et aL, 1997) and the bovine 
sequence, a series of genomic clones from human chromosome 5 were identified; 
GenBank AC010310 (SEQ ID NO:28) appears to contain the entire sequence. 
Interestingly, a partial cDNA sequence from mouse (GenBank API 84 153; SEQ ID 
NO:29) localizes to a 0.8 cM interval on the short arm of chromosome 5, between the 
polymorphic microsatellite markers D5S416 and D5S21 14. In this region lies the gene 
for familial chondrocalcinosis (CCAL2) (Rojas et al, 1999). 

Another GenBank entry, AB018342 (SEQ ID NO:30) that represents the 3' region 
of hMX, appears to encode an alternative splice form. Noteworthy, this variant (hMX2) 
has a very hydrophobic carboxy terminus, while the more prevalent form (hMXl) is 
hydrophilic and potentially interacts with DNA/RNA since it has homology to high 
mobility group box (HMG) and ribosomal proteins. Additionaly, a myosin head domain 
was found in the NH terminus, as well as a myosin talin domain, two calmodulin binding 
domains, four pleckstrin domains and a band 4.1 domain. 

The band 4.1 domain represents a crossroads between cytoskeletal organization 
and signal transduction. The domain was first described in the red blood cell protein band 
4.1. The ERM proteins ezrin, radixin, and moesin and the unconventional myosins Vila 
and X all possess the band 4.1 domain (Lou vet- Val lee, 2000). The band 4.1 domain 
binds single transmembrane protein at the membrane-proximal region in the C-terminal 
cytoplasmic tail. 

HMX is upregulated during angiogenesis in an in vitro model (Example 1). 
Because hMX contains the protein-protein interaction domains PH and band 4.1 domain, 
hMXl and hMX2 are involved in angiogenesis, likely transducing signals from 
angiogenic factors, perhaps modulating the cytoskeleton. 

Human mitochondrial protein (hMP) 

Analysis of hMP (SEQ ID NO: 14) reveals several subdomain that are homologous 
to proteins involved in transport across membranes, K + ATPase a and y chains. Further 
analysis indicates that hMP may bind DNA and or RNA, since hMP is homologous to 
histones and transcription factors, especially those possessing basic region plus leucine 
zipper domains. 
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Although PSORT analysis (Nakai and Horton, 1999) predicts nuclear localization 
(P=.6), hMP may in fact be a nuclear-encoded mitochondrial protein. Homologies with 
mostly bacterial proteins and a PSORT prediction ofmitochondrial matrix space 
localization (P=0.4478) strongly support this contention. 

Because hMP is upregulated in an in vitro model of angiogenesis (Example 1), 
and because of its homologies with mitochondrial and nuclear-localized polypeptides, 
hMP is important in vascular morphogenesis, most likely through either powering the 
cellular differentiation-redifferentiation process, and/or affecting changes in the nuclear 
matrix that change global gene expression. Alternatively, hMP may be a transcription 
factor for either the nuclear or mitochondrial genomes. 

Nuclear hormone receptor (NHR) 

NHR (SEQ ID NO: 16) has two domains: (1) the NH region is similar to Swi3 
(yeast SWI/SNF complexes regulate transcription by chromatin remodeling), indicating a 
role in transcriptional regulation, and (2) the COOH region is similar to parathyroid 
hormone-related proteins that bind parathyroid hormones. PSORT (Nakai and Horton, 
1999) predicts the protein to localize in the nucleus P=0.9600. 

The identification of this new putative hormone receptor-transcriptional regulator 
and hBAZF suggest a novel human transcriptional pathway that resembles, to some 
extent, that of Bcl-6. 

Bcl-6 suppresses transcription via the BTB domain, which recruits a complex 
containing SMRT, retinoid thyroid hormone receptor, nuclear receptor corepressor (N- 
CoR), mammalian Sin3A, and histone deacetylase (HDAC). hBAZF, which also 
possesses a BTB domain, might recruit a similar complex containing deacetylase. 
Expression data indicate that hBAZF is up-regulated while NHR is down-regulated. 
These data agree with other evidence related to tube formation. Testosterone (a steroid) 
and dexamethasone (a steroid-like molecule) strongly inhibit vessel formation, and all- 
trans retinoic acid (at-RA) and 9-cis retinoic acid (9-cis RA) stimulate capillary-like 
tubular structures (Lansink et aL, 1998). 

Upon angiogenic stimulation, endothelial cells may become incompetent to 
respond to anti-angiogenic responses mediated by hormones using a dual mechanism, 
sequestering hormones and suppressing transcription. Because nHR is down-regulated 
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during in vitro angiogenesis (Example 1), this polypeptide is likely to be involved in non- 
angiogenesis-specific gene transcription. nHR is an attractive therapeutic target, 
especially in therapies that are directed at preventing vascularization. 

AAP polynucleotides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
AAP or biologically-active portions thereof. Also included in the invention are nucleic 
acid fragments sufficient for use as hybridization probes to identify AAP-encoding 
nucleic acids {e.g., AAP mRNAs) and fragments for use as polymerase chain reaction 
(PCR) primers for the amplification and/or mutation of AAP molecules. A "nucleic acid 
molecule" includes DNA molecules (e.g., cDNA or genomic DNA), RNA molecules 
(e.g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and 
derivatives, fragments and homologs. The nucleic acid molecule may be single-stranded 
or double-stranded, but preferably comprises double-stranded DNA. 

1 . probes 

Probes are nucleic acid sequences of variable length, preferably between at least 
about 10 nucleotides (nt), 100 nt, or many (e.g., 6,000 nt) depending on the specific use. 
Probes are used to detect identical, similar, or complementary nucleic acid sequences. 
Longer length probes can be obtained from a natural or recombinant source, are highly 
specific, and much slower to hybridize than shorter-length oligomer probes. Probes may 
be single- or double-stranded and designed to have specificity in PCR, membrane-based 
hybridization technologies, or ELIS A-like technologies. Probes are substantially purified 
oligonucleotides that will hybridize under stringent conditions to at least optimally 12, 25, 
50, 100, 150, 200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence of 
SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, or 15; or an anti-sense strand nucleotide sequence of 
these sequences; or of a naturally occurring mutant of these sequences. 

The full- or partial length native sequence AAP may be used to "pull out" similar 
(homologous) sequences (Ausubel etal, 1987; Sambrook, 1989), such as: (1) full-length 
or fragments of AAP cDNA from a cDNA library from any species (e.g. human, murine, 
feline, canine, bacterial, viral, retroviral, yeast), (2) from cells or tissues, (3) variants 
within a species, and (4) homologues and variants from other species. To find related 
sequences that may encode related genes, the probe may be designed to encode unique 
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sequences or degenerate sequences. Sequences may also be genomic sequences including 
promoters, enhancer elements and introns of native sequence AAP. 

For example, an AAP coding region in another species may be isolated using such 
probes. A probe of about 40 bases is designed, based on an AAP, and made. To detect 

32 35 

hybridizations, probes are labeled using, for example, radionuclides such as or JJ S, or 
enzymatic labels such as alkaline phosphatase coupled to the probe via avidin-biotin 
systems. Labeled probes are used to detect nucleic acids having a complementary 
sequence to that of an AAP in libraries of cDNA, genomic DNA or mRNA of a desired 
species. 

Such probes can be used as a part of a diagnostic test kit for identifying cells or 
tissues which mis-express an AAP, such as by measuring a level of an AAP in a sample of 
cells from a subject e.g., detecting AAP mRNA levels or determining whether a genomic 
AAP has been mutated or deleted 

2. isolated nucleic acid 

An isolated nucleic acid molecule is separated from other nucleic acid molecules 
that are present in the natural source of the nucleic acid. Preferably, an isolated nucleic 
acid is free of sequences that naturally flank the nucleic acid (i.e., sequences located at the 
5'- and 3'-termini of the nucleic acid) in the genomic DNA of the organism from which 
the nucleic acid is derived. For example, in various embodiments, isolated AAP 
molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of 
the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). 
Moreover, an isolated nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material or culture medium when produced by 
recombinant techniques, or of chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence of SEQ ED NOS:l, 3, 5, 7, 9, 11, 13 or 15, or a complement of this 
aforementioned nucleotide sequence, can be isolated using standard molecular biology 
techniques and the provided sequence information. Using all or a portion of the nucleic 
acid sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 as a hybridization probe, AAP 
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molecules can be isolated using standard hybridization and cloning techniques (Ausubel 
et al., 1987; Sambrook, 1989). 

PCR amplification techniques can be used to amplify AAP using cDNA, mRNA 
or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers. 
Such nucleic acids can be cloned into an appropriate vector and characterized by DNA 
sequence analysis. Furthermore, oligonucleotides corresponding to AAP sequences can 
be prepared by standard synthetic techniques, e.g. , an automated DNA synthesizer. 

3. oligonucleotide 

An oligonucleotide comprises a series of linked nucleotide residues, which 
oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction 
or other application. A short oligonucleotide sequence may be based on, or designed 
from, a genomic or cDNA sequence and is used to amplify, confirm, or reveal the 
presence of an identical, similar or complementary DNA or RNA in a particular cell or 
tissue. Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 
50 nt, or 100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of 
the invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in 
length would further comprise at least 6 contiguous nucleotides of SEQ ID NOS:l, 3, 5, 
7, 9, 1 1, 13 or 15, or a complement thereof. Oligonucleotides may be chemically 
synthesized and may also be used as probes. 

4. complementary nucleic acid sequences; binding 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule that is a complement of the nucleotide sequence shown 
in SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13 or 15, or a portion of this nucleotide sequence {e.g., a 
fragment that can be used as a probe or primer or a fragment encoding a biologically- 
active portion of an AAP). A nucleic acid molecule that is complementary to the 
nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, is one that is 
sufficiently complementary to the nucleotide sequence shown in SEQ ID NOS:l, 3, 5, 7, 
9, 1 1, 13 or 15, that it can hydrogen bond with little or no mismatches to the nucleotide 
sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, thereby forming a stable 
duplex. 

"Complementary" refers to Watson-Crick or Hoogsteen base pairing between 
nucleotides units of a nucleic acid molecule, and the term "binding" means the physical 
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or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, 
van der Waals, hydrophobic interactions, and the like. A physical interaction can be 
either direct or indirect. Indirect interactions may be through or due to the effects of 
another polypeptide or compound Direct binding refers to interactions that do not take 
place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

Nucleic acid fragments are at least 6 (contiguous) nucleic acids or at least 4 
(contiguous) amino acids, a length sufficient to allow for specific hybridization in the 
case of nucleic acids or for specific recognition of an epitope in the case of amino acids, 
respectively, and are at most some portion less than a full-length sequence. Fragments 
may be derived from any contiguous portion of a nucleic acid or amino acid sequence of 
choice. 

5. derivatives, and analogs 

Derivatives are nucleic acid sequences or amino acid sequences formed from the 
native compounds either directly or by modification or partial substitution. Analogs are 
nucleic acid sequences or amino acid sequences that have a structure similar to, but not 
identical to, the native compound but differ from it in respect to certain components or 
side chains. Analogs may be synthetic or from a different evolutionary origin and may 
have a similar or opposite metabolic activity compared to wild type. Homologs are 
nucleic acid sequences or amino acid sequences of a particular gene that are derived from 
different species. 

Derivatives and analogs may be full length or other than full length, if the 
derivative or analog contains a modified nucleic acid or amino acid, as described below. 
Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not 
limited to, molecules comprising regions that are substantially homologous to the nucleic 
acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 
95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid 
sequence of identical size or when compared to an aligned sequence in which the 
alignment is done by a computer homology program known in the art, or whose encoding 
nucleic acid is capable of hybridizing to the complement of a sequence encoding the 
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aforementioned proteins under stringent, moderately stringent, or low stringent conditions 
(Ausubel et al y 1987). 

6. homology 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 
sequences coding for isoforms of AAP. Isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. Alternatively, 
different genes can encode isoforms. In the invention, homologous nucleotide sequences 
include nucleotide sequences encoding for an AAP of species other than humans, 
including, but not limited to: vertebrates, and thus can include, e.g., frog, mouse, rat, 
rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide sequences also 
include, but are not limited to, naturally occurring allelic variations and mutations of the 
nucleotide sequences set forth herein. A homologous nucleotide sequence does not, 
however, include the exact nucleotide sequence encoding human AAP. Homologous 
nucleic acid sequences include those nucleic acid sequences that encode conservative 
amino acid substitutions (see below) in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, as well 
as a polypeptide possessing AAP biological activity. Various biological activities of the 
AAP are described below. 

7. open reading frames 

The open reading frame (ORF) of an AAP gene encodes an AAP. An ORF is a 
nucleotide sequence that has a start codon (ATG) and terminates with one of the three 
"stop" codons (TAA, TAG, or TGA). In this invention, however, an ORF may be any 
part of a coding sequence that may or may not comprise a start codon and a stop codon. 
To achieve a unique sequence, preferable AAP ORFs encode at least 50 amino acids. 

AAP polypeptides 

1 . mature 

An AAP can encode a mature AAP. A "mature" form of a polypeptide or protein 
disclosed in the present invention is the product of a naturally occurring polypeptide or 
precursor form or proprotein. The naturally occurring polypeptide, precursor or 
proprotein includes, by way of nonlimiting example, the full-length gene product, 
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encoded by the corresponding gene. Alternatively, it may be defined as the polypeptide, 
precursor or proprotein encoded by an open reading frame described herein. The product 
"mature" form arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the cell, or host cell, in 
which the gene product arises. Examples of such processing steps leading to a "mature" 
form of a polypeptide or protein include the cleavage of the N-terminal methionine 
residue encoded by the initiation codon of an open reading frame, or the proteolytic 
cleavage of a signal peptide or leader sequence. Thus a mature form arising from a 
precursor polypeptide or protein that has residues 1 to N, where residue 1 is the N- 
terminal methionine, would have residues 2 through N remaining after removal of the N- 
terminal methionine. Alternatively, a mature form arising from a precursor polypeptide 
or protein having residues 1 to N, in which an N-terminal signal sequence from residue 1 
to residue M is cleaved, would have the residues from residue M+l to residue N 
remaining. Further as used herein, a "mature" form of a polypeptide or protein may arise 
from a step of post-translational modification other than a proteolytic cleavage event. 
Such additional processes include, by way of non-limiting example, glycosylation, 
myristoylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 
2. active 

An active AAP polypeptide or AAP polypeptide fragment retains a biological 
and/or an immunological activity similar, but not necessarily identical, to an activity of a 
naturally-occuring (wild-type) AAP polypeptide of the invention, including mature forms. 
A particular biological assay, with or without dose dependency, can be used to determine 
AAP activity. A nucleic acid fragment encoding a biologically-active portion of AAP can 
be prepared by isolating a portion of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 that encodes 
a polypeptide having an AAP biological activity (the biological activities of the AAP are 
described below), expressing the encoded portion of AAP (e.g., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of AAP. 
Immunological activity refers to the ability to induce the production of an antibody 
against an antigenic epitope possessed by a native AAP; biological activity refers to a 
function, either inhibitory or stimulatory, caused by a native AAP that excludes 
immunological activity. 
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AAP nucleic acid variants and hybridization 

1. variant polynucleotides, genes and recombinant genes The invention 
further encompasses nucleic acid molecules that differ from the nucleotide sequences 
shown in SEQ ID NOS.l, 3, 5, 7, 9, 11, 13 or 15 due to degeneracy of the genetic code 
and thus encode the same AAP as that encoded by the nucleotide sequences shown in 
SEQ ID NO NOS:l, 3, 5, 7, 9, 11, 13 or 15. An isolated nucleic acid molecule of the 
invention has a nucleotide sequence encoding a protein having an amino acid sequence 
shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. 

In addition to the AAP sequences shown in SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13 or 
15, DNA sequence polymorphisms that change the amino acid sequences of the AAP may 
exist within a population. For example, allelic variation among individuals will exhibit 
genetic polymorphism in an AAP, The terms "gene" and "recombinant gene" refer to 
nucleic acid molecules comprising an open reading frame (ORF) encoding an AAP, 
preferably a vertebrate AAP. Such natural allelic variations can typically result in 1-5% 
variance in an AAP. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in an AAP, which are the result of natural allelic variation and that do not 
alter the functional activity of an AAP are within the scope of the invention. 

Moreover, AAP from other species that have a nucleotide sequence that differs 
from the human sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13 or 15, are contemplated 
Nucleic acid molecules corresponding to natural allelic variants and homologues of an 
AAP cDNAs of the invention can be isolated based on their homology to an AAP of SEQ 
ID NOS:l, 3, 5, 7, 9, 11, 13 or 15 using cDNA-derived probes to hybridize to 
homologous AAP sequences under stringent conditions. 

"AAP variant polynucleotide" or "AAP variant nucleic acid sequence" means a 
nucleic acid molecule which encodes an active AAP that (1) has at least about 80% 
nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length 
native AAP, (2) a full-length native AAP lacking the signal peptide, (3) an extracellular 
domain of an AAP, with or without the signal peptide, or (4) any other fragment of a full- 
length AAP. Ordinarily, an AAP variant polynucleotide will have at least about 80% 
nucleic acid sequence identity, more preferably at least about 81%, 82%, 83%, 84%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% nucleic 
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acid sequence identity and yet more preferably at least about 99% nucleic acid sequence 
identity with the nucleic acid sequence encoding a full-length native A AR An AAP 
variant polynucleotide may encode a full-length native AAP lacking the signal peptide, an 
extracellular domain of an AAP, with or without the signal sequence, or any other 
fragment of a full-length AAP. Variants do not encompass the native nucleotide 
sequence. 

Ordinarily, AAP variant polynucleotides are at least about 30 nucleotides in 
length, often at least about 60, 90, 120, 150, 180, 210, 240, 270, 300, 450, 600 
nucleotides in length, more often at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to AAP-encoding 
nucleic acid sequences identified herein is defined as the percentage of nucleotides in a 
candidate sequence that are identical with the nucleotides in the AAP sequence of interest, 
after aligning the sequences and introducing gaps, if necessary, to achieve the maximum 
percent sequence identity. Alignment for purposes of determining % nucleic acid 
sequence identity can be achieved in various ways that are within the skill in the art, for 
instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN 
or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate 
parameters for measuring alignment, including any algorithms needed to achieve 
maximal alignment over the full length of the sequences being compared. 

When nucleotide sequences are aligned, the % nucleic acid sequence identity of a 
given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which 
can alternatively be phrased as a given nucleic acid sequence C that has or comprises a 
certain % nucleic acid sequence identity to, with, or against a given nucleic acid sequence 
D) can be calculated as follows: 

%nucleic acid sequence identity = W/Z * 100 

where 

W is the number of nucleotides cored as identical matches by the sequence 
alignment program's or algorithm's alignment of C and D 
and 

Z is the total number of nucleotides in D. 
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When the length of nucleic acid sequence C is not equal to the length of nucleic 
acid sequence D, the % nucleic acid sequence identity of C to D will not equal the % 
nucleic acid sequence identity of D to C. 
2. Stringency 

Homologs (i.e., nucleic acids encoding an AAP derived from species other than 
human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or 
high stringency hybridization with all or a portion of the particular human sequence as a 
probe using methods well known in the art for nucleic acid hybridization and cloning. 

The specificity of single stranded DNA to hybridize complementary fragments is 
determined by the "stringency" of the reaction conditions. Hybridization stringency 
increases as the propensity to form DNA duplexes decreases. In nucleic acid 
hybridization reactions, the stringency can be chosen to either favor specific 
hybridizations (high stringency), which can be used to identify, for example, full-length 
clones from a library. Less-specific hybridizations (low stringency) can be used to 
identify related, but not exact, DNA molecules (homologous, but not identical) or 
segments. 

DNA duplexes are stabilized by: (1) the number of complementary base pairs, (2) 
the type of base pairs, (3) salt concentration (ionic strength) of the reaction mixture, (4) 
the temperature of the reaction, and (5) the presence of certain organic solvents, such as 
formamide which decreases DNA duplex stability. In general, the longer the probe, the 
higher the temperature required for proper annealing. A common approach is to vary the 
temperature: higher relative temperatures result in more stringent reaction conditions. 
(Ausubel et a/., 1987) provide an excellent explanation of stringency of hybridization 
reactions. 

To hybridize under "stringent conditions" describes hybridization protocols in 
which nucleotide sequences at least 60% homologous to each other remain hybridized 
Generally, stringent conditions are selected to be about 5°C lower than the thermal 
melting point (Tm) for the specific sequence at a defined ionic strength andpH. The Tm 
is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium. Since the target sequences are generally present at excess, at 
Tm, 50% of the probes are occupied at equilibrium. 
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(a) high stringency 

"Stringent hybridization conditions" conditions enable a probe, primer or 
oligonucleotide to hybridize only to its target sequence. Stringent conditions are 
sequence-dependent and will differ. Stringent conditions comprise: (1) low ionic strength 
and high temperature washes (e.g. 15 mM sodium chloride, 1.5 mM sodium citrate, 0.1 % 
sodium dodecyl sulfate at 50°C); (2) a denaturing agent during hybridization (e.g. 50% 
(v/v) formamide, 0.1% bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 
50mM sodium phosphate buffer (pH 6.5; 750 mM sodium chloride, 75 mM sodium 
citrate at 42°C); or (3) 50% formamide. Washes typically also comprise 5X SSC (0.75 M 
NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium 
pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 ug/ml), 0.1% 
SDS, and 10% dextran sulfate at 42°C, with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% formamide at 55°C, followed by a high-stringency wash 
consisting of 0.1 x SSC containing EDTA at 55°C. Preferably, the conditions are such 
that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous 
to each other typically remain hybridized to each other. These conditions are presented as 
examples and are not meant to be limiting. 

(b) moderate stringency 

"Moderately stringent conditions" use washing solutions and hybridization 
conditions that are less stringent (Sambrook, 1989), such that a polynucleotide will 
hybridize to the entire, fragments, derivatives or analogs of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 
13 or 15. One example comprises hybridization in 6X SSC, 5X Denhardt's solution, 0.5% 
SDS and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more 
washes in IX SSC, 0. 1% SDS at 37°C. The temperature, ionic strength, etc., can be 
adjusted to accommodate experimental factors such as probe length. Other moderate 
stringency conditions are described in (Ausubel et ai, 1987; Kriegler, 1990). 

(c) low stringency 

"Low stringent conditions" use washing solutions and hybridization conditions 
that are less stringent than those for moderate stringency (Sambrook, 1989), such that a 
polynucleotide will hybridize to the entire, fragments, derivatives or analogs of SEQ ID 
NOS:l,3,5,7,9, 11, 13 or 15. A non-limiting example of low stringency hybridization 
conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 
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mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm 
DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 
25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of low 
stringency, such as those for cross-species hybridizations are described in (Ausubel et al., 
1987; Kriegler, 1990; Shilo and Weinberg, 1981). 
3. Conservative mutations 

In addition to naturally-occurring allelic variants of AAP, changes can be 
introduced by mutation into SEQ ID NO NOS:l, 3, 5, 7, 9, 11, 13 or 15 sequences that 
incur alterations in the amino acid sequences of the encoded AAP that do not alter the 
AAP function. For example, nucleotide substitutions leading to amino acid substitutions 
at "non-essential" amino acid residues can be made in the sequence of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14 or 16. A "non-essential" amino acid residue is a residue that can be 
altered from the wild-type sequences of the AAP without altering their biological activity, 
whereas an "essential" amino acid residue is required for such biological activity. For 
example, amino acid residues that are conserved among the AAP of the invention are 
predicted to be particularly non-amenable to alteration. Amino acids for which 
conservative substitutions can be made are well-known in the art. 

Useful conservative substitutions are shown in Table A, "Preferred substitutions." 
Conservative substitutions whereby an amino acid of one class is replaced with another 
amino acid of the same type fall within the scope of the subject invention so long as the 
substitution does not materially alter the biological activity of the compound. If such 
substitutions result in a change in biological activity, then more substantial changes, 
indicated in Table B as exemplary are introduced and the products screened for an AAP 
polypeptide's biological activity. 



Table A Preferred substitutions 



Original residue 


Exemplary substitutions 


Preferred substitutions 


Ala (A) 


Val, Leu, He 


Val 


Arg (R) 


Lys, Gin, Asn 


Lys 


Asn (N) 


Gin, His, Lys, Arg 


Gin 


Asp (D) 


Glu 


Glu 


Cvs (C) 


Ser 


Ser 


Gin (O) 


Asn 


Asn 


Glu (E) 


Asp 


Asp 
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Gly (G) 


Prn A 1 a 


Ala 


His (H) 


v"voii, VJ1I1, L^ys, /\rg 


Arg 


He (I) 


I.£ii Val lVfpt Ala PKs 
x-<t>u, vai, lvici, /\ldj rile, 

Norlpnr*inf» 


Leu 


Leu (L) 


Norleucine Tie Val M*»t Ala 

xiviivuvillW) Ills, V<21, IVlClj /Aid, 

Phe 


He 


Lys (K) 


Ar<* Gin Asn 


Arg 


Met (M) 


Leu, Phe, He 


Leu 


Phe (F) 


Leu, Val, lie. Ala. Tvr 


Leu 


Pro (P) 


Ala 


Ala 


Ser (S) 


Thr 


Thr 


Thr (T) 


Ser 


Ser 


Trp (W) 


Tyr, Phe 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Phe 


Val (V) 


He, Leu, Met, Phe, Ala, 
Norleucine 


Leu 



Non-conservative substitutions that effect (1) the structure of the polypeptide 
backbone, such as a p-sheet or ct-helical conformation, (2) the charge or (3) 
hydrophobicity, or (4) the bulk of the side chain of the target site can modify an AAP 
polypeptide's function or immunological identity. Residues are divided into groups based 
on common side-chain properties as denoted in Table B. Non-conservative substitutions 
entail exchanging a member of one of these classes for another class. Substitutions may 
be introduced into conservative substitution sites or more preferably into non-conserved 
sites. 



Table B Amino acid classes 



Class 


Amino acids 


hydrophobic 


Norleucine, Met, Ala, Val, Leu, He 


neutral hydrophilic 


Cys, Ser, Thr 


acidic 


Asp, Glu 


basic 


Asn, Gin, His, Lys, Arg 


disrupt chain conformation 


Gly, Pro 


aromatic 


Trp, Tyr, Phe 



The variant polypeptides can be made using methods known in the art such 
oligonucleotide-mediated (site-directed) mutagenesis, alanine scanning, and PCR 
mutagenesis. Site-directed mutagenesis (Carter, 1986; Zoller and Smith, 1987), ca; 
mutagenesis, restriction selection mutagenesis (Wells et ah, 1985) or other known 
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techniques can be performed on the cloned DNA to produce the AAP variant DNA 
(Ausubel etal., 1987; Sambrook, 1989). 

In one embodiment, the isolated nucleic acid molecule comprises a nucleotide 
sequence encoding a protein, wherein the protein comprises an amino acid sequence at 
least about 45%, preferably 60%, more preferably 70%, 80%, 90%, andmost preferably 
about 95% homologous to SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, or 15. 

A mutant AAP can be assayed for blocking angiogenesis in vitro. 

4. Anti-sense nucleic acids 

Using antisense and sense AAP oligonucleotides can prevent AAP polypeptide 
expression. These oligonucleotides bind to target nucleic acid sequences, forming 
duplexes that block transcription or translation of the target sequence by enhancing 
degradation of the duplexes, terminating prematurely transcription or translation, or by 
other means. 

Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA 
or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) 
sequences. Anti-sense nucleic acids can be designed according to Watson and Crick or 
Hoogsteen base pairing rules. The anti-sense nucleic acid molecule can be 
complementary to the entire coding region of an AAP mRNA, but more preferably, to 
only a portion of the coding or noncoding region of an AAP mRNA. For example, the 
anti-sense oligonucleotide can be complementary to the region surrounding the translation 
start site of an AAP mRNA. Antisense or sense oligonucleotides may comprise a 
fragment of the AAP DNA coding region of at least about 14 nucleotides, preferably from 
about 14 to 30 nucleotides. In general, antisense RNA or DNA molecules can comprise 
at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 bases in 
length or more. Among others, (Stein and Cohen, 1988; van der Krol et ai, 1988a) 
describe methods to derive antisense or a sense oligonucleotides from a given cDNA 
sequence. 

Examples of modified nucleotides that can be used to generate the anti-sense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
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methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyanunomethyl-2-thiouracil, beta-D- 
mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
anti-sense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been sub-cloned in an anti-sense orientation such that the 
transcribed RNA will be complementary to a target nucleic acid of interest. 

To introduce antisense or sense oligonucleotides into target cells (cells containing 
the target nucleic acid sequence), any gene transfer method may be used. Examples of 
gene transfer methods include (1) biological, such as gene transfer vectors like Epstein- 
Barr virus or conjugating the exogenous DNA to a ligand-binding molecule, (2) physical, 
such as electroporation and injection, and (3) chemical, such as CaP0 4 precipitation and 
oligonucleotide-lipid complexes. 

An antisense or sense oligonucleotide is inserted into a suitable gene transfer 
retroviral vector. A cell containing the target nucleic acid sequence is contacted with the 
recombinant retroviral vector, either in vivo or ex vivo. Examples of suitable retroviral 
vectors include those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and 
DCT5C (WO 90/13641, 1990). To achieve sufficient nucleic acid molecule 
transcription, vector constructs in which the transcription of the anti-sense nucleic acid 
molecule is controlled by a strong pol H or pol m promoter are preferred. 

To specify target cells in a mixed population of cells cell surface receptors that are 
specific to the target cells can be exploited. Antisense and sense oligonucleotides are 
conjugated to a ligand-binding molecule, as described in (WO 91/04753, 1991). Ligands 
are chosen for receptors that are specific to the target cells. Examples of suitable ligand- 
binding molecules include cell surface receptors, growth factors, cytokines, or other 
ligands that bind to cell surface receptors or molecules. Preferably, conjugation of the 
ligand-binding molecule does not substantially interfere with the ability of the receptors 
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or molecule to bind the ligand-binding molecule conjugate, or block entry of the sense or 
antisense oligonucleotide or its conjugated version into the cell. 

Liposomes efficiently transfer sense or an antisense oligonucleotide to cells (WO 
90/10448, 1990). The sense or antisenseoligonucleotide-lipid complex is preferably 
dissociated within the cell by an endogenous lipase. 

The anti-sense nucleic acid molecule of the invention may be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual a-units. the 
strands run parallel to each other (Gautier et al, 1987). The anti-sense nucleic acid 
molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al, 1987a) or a 
chimeric RNA-DNA analogue (Inoue et al, 1987b). 

In one embodiment, an anti-sense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, nbozymes, such as hammerhead ribozymes (Haseloff and 
Gerlach, 1988) can be used to catalytically cleave AAP mRNA transcripts and thus inhibit 
translation. A ribozyme specific for an AAP-encoding nucleic acid can be designed 
based on the nucleotide sequence of an AAP cDNA {i.e., SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13 
or 15). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed m 
which the nucleotide sequence of the active site is complementary to the nucleotide 
sequence to be cleaved in an AAP-encoding mRNA (Cech et al, U.S. Patent No. 
5,1 16,742, 1992; Cech et al, U.S. Patent No. 4,987,071, 1991). An AAP mRNA can also 
be used to select a catalytic RNA having a specific ribonuclease activity from a pool of 
RNA molecules (Bartel and Szostak, 1993). 

Alternatively, AAP expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of an AAP {e.g., an AAP promoter and/or 
enhancers) to form triple helical structures that prevent transcription of an AAP in target 
cells (Helene, 1991; Helene et al, 1992; Maher, 1992). 

Modifications of antisense and sense oligonucleotides can augment their 
effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 
91/06629 1991), increase in vivo stability by conferring resistance to endogenous 
nucleases without disrupting binding specificity to target sequences. Other modifications 
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can increase the affinities of the oligonucleotides for their targets, such as covalently 
linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments 
modify binding specificities of the oligonucleotides for their targets, including metal 
complexes or intercalating (e.g. ellipticine) and alkylating agents. 

For example, the deoxyribose phosphate backbone of the nucleic acids can be 
modified to generate peptide nucleic acids (Hyrup and Nielsen, 1996). "Peptide nucleic 
acids" or "PNAs" refer to nucleic acid mimics (e.g., DNA mimics) in that the deoxynbose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs allows for specfic 
hybridization to DNA and RNA under conditions of low ionic strength. The synthes 1S of 
PNA oligomers can be performed using standard solid phase peptide synthesis protocols 
(Hyrup and Nielsen, 1996; Perry-O'Keefe et al., 1996). 

PNAs of an AAP can be used in therapeutic and diagnostic applications. For 
example PNAs can be used as anti-sense or antigene agents for sequence-specific 
modulation of gene expression by inducing transection or translation arrest or minting 
replication. AAP PNAs may also be used in the analysis of single base pair mutations 
(e g PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g., S, nucleases (Hyrup and Nielsen, 1996); or as 
probes or primers for DNA sequence and hybndization (Hyrup and Nielsen, 1996; Perry- 

O'Keefe et al., 1996). 

PNAs of an AAP can be modified to enhance their stability or cellular uptake, 
lipophilic or other helper groups may be attached to PNAs, PNA-DNA dimmers formed, 
or the use of liposomes or other drug delivery techniques. For example, PNA-DNA 
chimeras can be generated that may combine the advantageous properties of PNA and 
DNA Such chimeras allow DNA recognition enzymes (e.g., RNase H and DNA 
polymerases) to interact with the DNA portion while the PNA portion provides h lg h 
binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of 
appropriate lengths selected in terms of base stacking, number of bonds between the 
nucleobases, and orientation (Hyrup and Nielsen, 1996). The synthesis of PNA-DNA 
chimeras can be performed (Finn et al., 1996; Hyrup and Nielsen, 1996). For example, a 
DNA chain can be synthesized on a solid support using standard phosphoram.dite 
coupling chemistry, and modified nucleoside analogs, e.g., 5--(4-methoxytrity.)amino-5-- 
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deoxy-thymidine phosphoramidite, can be used between the PNA and the 5' end of DNA 
(Finn et al., 1996; Hyrup and Nielsen, 1996). PNA monomers are then coupled in a 
stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn et al., 1996). Alternatively, chimeric molecules can be synthesized with a 
5' DNA segment and a 3' PNA segment (Petersen et al, 1976). 

The oligonucleotide may include other appended groups such as peptides {e.g., for 
targeting host cell receptors in vivo), or agents facilitating transport across the cell 
membrane (Lemaitre et al, 1987; Letsinger et al., 1989) or PCT Publication No. 
WO88/09810) or the blood-brain barrier (e.g., PCT Publication No. WO 89/10134). In 
addition, oligonucleotides can be modified with hybridization-triggered cleavage agents 
(van der Krol et al., 1988b) or intercalating agents (Zon, 1988). The oligonucleotide may 
be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking 
agent, a transport agent, a hybridization-triggered cleavage agent, and the like. 

AAP polypeptides 

One aspect of the invention pertains to isolated AAP, and biologically-active 
portions derivatives, fragments, analogs or homologs thereof. Also provided are 
polypeptide fragments suitable for use as immunogens to raise anti-AAP Abs. In one 
embodiment, a native AAP can be isolated from cells or tissue sources by an appropriate 
purification scheme using standard protein purification techniques. In another 
embodiment, AAP are produced by recombinant DNA techniques. Alternative to 
recombinant expression, an AAP can be synthesized chemically using standard peptide 

synthesis techniques. 

1 . Polypeptides 

An AAP polypeptide includes the amino acid sequence of an AAP whose 
sequences are provided in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16. The invention also 
includes a mutant or variant protein any of whose residues may be changed from the 
corresponding residues shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, while still 
encoding a protein that maintains its AAP activities and physiological functions, or a 
functional fragment thereof. 

2. Variant AAP polypeptides 
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computer software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) 
software is used to align peptide sequences. Those skilled in the art can determine 
appropriate parameters for measuring alignment, including any algorithms needed to 
achieve maximal alignment over the full length of the sequences being compared. 

When amino acid sequences are aligned, the % amino acid sequence identity of a 
given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence 
B) can be calculated as: 

%amino acid sequence identity = X/Y ' 100 
where 

X is the number of amino acid residues scored as identical matches by the 
sequence alignment program's or algorithm's alignment of A and B 
and 

Y is the total number of amino acid residues in B. 

If the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 

3. Isolated/purified polypeptides 

An "isolated" or "purified" polypeptide, protein or biologically active fragment is 
separated and/or recovered from a component of its natural environment. Contaminant 
components include materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or 
non-proteinaceous materials. Preferably, the polypeptide is purified to a sufficient degree 
to obtain at least 15 residues of N-terminal or internal amino acid sequence. To be 
substantially isolated, preparations having less than 30% by dry weight of non-AAP 
contaminating material (contaminants), more preferably less than 20%, 10% and most 
preferably less than 5% contaminants. An isolated, recombinantly-produced AAP or 
biologically active portion is preferably substantially free of culture medium, i.e., culture 
medium represents less than 20%, more preferably less than about 10%, and most 
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preferably less than about 5% of the volume of the AAP preparation. Examples of 
contaminants include cell debris, culture media, and substances used and produced during 
in vitro synthesis of an AAP. 

4. Biologically active 

Biologically active portions of an AAP include peptides comprising amino acid 
sequences sufficiently homologous to or derived from the amino acid sequences of an 
AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16) that include fewer amino acids than a 
full-length AAP, and exhibit at least one activity of an AAP. Biologically active portions 
comprise a domain or motif with at least one activity of a native AAP. A biologically 
active portion of an AAP can be a polypeptide that is, for example, 10, 25, 50, 100 or 
more amino acid residues in length. Other biologically active portions, in which other 
regions of the protein are deleted, can be prepared by recombinant techniques and 
evaluated for one or more of the functional activities of a native AAP. 

Biologically active portions of an AAP may have an amino acid sequence shown 
in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, or substantially homologous to SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14 or 16, and retains the functional activity of the protein of SEQ 
ID NOS:2, 4, 6, 8, 10, 12, 14 or 16, yet differs in amino acid sequence due to natural 
allelic variation or mutagenesis. Other biologically active AAP may comprise an amino 
acid sequence at least 45% homologous to the amino acid sequence of SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14 or 16, and retains the functional activity of native AAP. 

5. Determining homology between two or more sequences 

"AAP variant" means an active AAP having at least: (1) about 80% amino acid 
sequence identity with a full-length native sequence AAP sequence, (2) an AAP sequence 
lacking the signal peptide, (3) an extracellular domain of an AAP, with or without the 
signal peptide, or (4) any other fragment of a full-length AAP sequence. For example, 
AAP variants include an AAP wherein one or more amino acid residues are added or 
deleted at the N- or C- terminus of the full-length native amino acid sequence. An AAP 
variant will have at least about 80% amino acid sequence identity, preferably at least 
about 81% amino acid sequence identity, more preferably at least about 82%, 83%, 84%, 
85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% amino 
acid sequence identity and most preferably at least about 99% amino acid sequence 
identity with a full-length native sequence AAP sequence. An AAP variant may have a 
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sequence lacking the signal peptide, an extracellular domain of an AAP, with or without 
the signal peptide, or any other fragment of a full-length AAP sequence. Ordinarily, AAP 
variant polypeptides are at least about 10 amino acids in length, often at least about 20 
amino acids in length, more often at least about 30, 40, 50, 60, 70, 80, 90, 100, 150, 200, 
or 300 amino acids in length, or more. 

"Percent (%) amino acid sequence identity" is defined as the percentage of amino 
acid residues that are identical with amino acid residues in a disclosed AAP sequence in a 
candidate sequence when the two sequences are aligned To determine % amino acid 
identity, sequences are aligned and if necessary, gaps are introduced to achieve the 
maximum % sequence identity; conservative substitutions are not considered as part of 
the sequence identity. Amino acid sequence alignment procedures to determine percent 
identity are well known to those of skill in the art. Often publicly available computer 
software such as BLAST, BLAST2, ALIGN2 or Megalign (DNASTAR) software is used 
to align peptide sequences. Those skilled in the art can determine appropriate parameters 
for measuring alignment, including any algorithms needed to achieve maximal alignment 
over the full length of the sequences being compared. 

When amino acid sequences are aligned, the % amino acid sequence identity of a 
given amino acid sequence A to, with, or against a given amino acid sequence B (which 
can alternatively be phrased as a given amino acid sequence A that has or comprises a 
certain % amino acid sequence identity to, with, or against a given amino acid sequence 
B) can be calculated as: 

%amino acid sequence identity = X/Y * 100 
where 

X is the number of amino acid residues scored as identical matches by the 
sequence alignment program's or algorithm's alignment of A and B 
and 

Y is the total number of amino acid residues in B. 

If the length of amino acid sequence A is not equal to the length of amino acid 
sequence B, the % amino acid sequence identity of A to B will not equal the % amino 
acid sequence identity of B to A. 
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6. Chimeric and fusion proteins 

Fusion polypeptides are useful in expression studies, cell-localization, bioassays, 
and AAP purification. An AAP "chimeric protein" or "fusion protein" comprises an AAP 
fused to a non-AAP polypeptide. A non-AAP polypeptide is not substantially 
homologous to an AAP (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14 or 16). An AAP fusion 
protein may include any portion to an entire AAP, including any number of the 
biologically active portions. An AAP may be fused to the C-terminus of the GST 
(glutathione S -transferase) sequences. Such fusion proteins facilitate the purification of a 
recombinant AAP. In certain host cells, {e.g. mammalian), heterologous signal 
sequences fusions may ameliorate AAP expression and/or secretion. Additional 
exemplary fusions are presented in Table C. 

Other fusion partners can adapt an AAP therapeutically. Fusions with members of 
the immunoglobulin (Ig) protein family are useful in therapies that inhibit an AAP ligand 
or substrate interactions, consequently suppressing an AAP-mediated signal transduction 
in vivo. Such fusions, incorporated into pharmaceutical compositions, may be used to 
treat proliferative and differentiation disorders, as well as modulating cell survival. An 
AAP-Ig fusion polypeptides can also be used as immunogens to produce an anti-AAP 
Abs in a subject, to purify AAP ligands, and to screen for molecules that inhibit 
interactions of an AAP with other molecules. 

Fusion proteins can be easily created using recombinant methods. A nucleic acid 
encoding an AAP can be fused in-frame with a non-AAP encoding nucleic acid, to an 
AAP NH 2 - or COO- -terminus, or internally. Fusion genes may also be synthesized by 
conventional techniques, including automated DNA synthesizers. PCR amplification 
using anchor primers that give rise to complementary overhangs between two consecutive 
gene fragments that can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (Ausubel et al, 1987) is also useful. Many vectors are commercially 
available that facilitate sub-cloning an AAP in-frame to a fusion moiety. 



Table C 



Useful non-AAP fusion polypeptides 



Reporter 



in vitro 



in vivo 



Notes 



Reference 



-79- 



Human growth 
hormone (hGH) 


Radioimmuno- 
assay 


none 


Expensive, 
insensitive, 
narrow linear 
range. 


^oeiuen et ai. y 
1986) 


P-glucu- 
ronidase (GUS) 


Colorimetric, 
fluorescent, or 
chemi- 
luminescent 


colorimetric 
(histo-chemical 
staining wun js.- 
glue) 


sensitive, 
broad linear 
range, non- 
iostopic. 


(Gallagher, 
1992) 


Green 
fluorescent 
protein (GFP) 
and related 
molecules (RFP, 
BFP, AAP, etc.) 


Fluorescent 


fluorescent 


can be used in 
live cells; 
resists photo- 
bleaching 


(Chalfie et al, 
1994) 


Luciferase 
(firefly) 


bioluminsecent 


Bio- 
luminescent 


protein is 
unstable, 
difficult to 
reproduce, 
signal is brief 


(de Wet et al, 
1987) 


Chloramphenico 
al 

acetyltransferas 
e (CAT) 


Chromato- 
graphy, 
differential 
extraction, 
fluorescent, or 
immunoassay 


none 


Expensive 
radioactive 
substrates, 
time- 
consuming, 
insensitive, 
narrow linear 
range 


(Gorman et al. , 
1982) 


P-galacto-sidase 


colorimetric, 
fluorescence, 
chemi- 
luminscence 


colorimetric 
(histochemical 
staining with X- 
gal), bio- 
luminescent in 
live cells 


sensitive, 
broad linear 
range; some 
cells have high 
endogenous 
activity 


(Alam and 
Cook, 1990) 


Secrete alkaline 

phosphatase 

(SEAP) 


colorimetric, 
bioluminescent, 
chemi- 
lurnincsceni 


none 


Chem- 
iluminscence 
assay is 
sensitive dnu 
broad linear 
range; some 
cells have 
endogenouse 
alkaline 
phosphatase 
activity 


(Berger et al., 
1988) 



Therapeutic applications of AAP 
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1. Agonists and antagonists 

"Antagonist" includes any molecule that partially or fully blocks, inhibits, or 
neutralizes a biological activity of an endogenous AAP. Similarly, "agonist" includes any 
molecule that mimics a biological activity of an endogenous AAP. Molecules that can act 
as agonists or antagonists include Abs or antibody fragments, fragments or variants of an 
endogenous AAP, peptides, antisense oligonucleotides, small organic molecules, etc. 

2. Identifying antagonists and agonists 

To assay for antagonists, an AAP is added to, or expressed in, a cell along with 
the compound to be screened for a particular activity. If the compound inhibits the 
activity of interest in the presence of an AAP, that compound is an antagonist to the AAP; 
if an AAP activity is enhanced, the compound is an agonist. 

(a) Specific examples of potential antagonists and agonist 
Any molecule that alters AAP cellular effects is a candidate antagonist or agonist. 
Screening techniques well known to those skilled in the art can identify these molecules. 
Examples of antagonists and agonists include: (1) small organic and inorganic 
compounds, (2) small peptides, (3) Abs and derivatives, (4) polypeptides closely related 
to an AAP, (5) antisense DNA and RNA, (6) ribozymes, (7) triple DNA helices and (8) 
nucleic acid aptamers. 

Small molecules that bind to an AAP active site or other relevant part of the 
polypeptide and inhibit the biological activity of the AAP are antagonists. Examples of 
small molecule antagonists include small peptides, peptide-like molecules, preferably 
soluble, and synthetic non-peptidyl organic or inorganic compounds. These same 
molecules, if they enhance an AAP activity, are examples of agonists. 

Almost any antibody that affects an AAP's function is a candidate antagonist, and 
occasionally, agonist. Examples of antibody antagonists include polyclonal, monoclonal, 
single-chain, anti-idiotypic, chimeric Abs, or humanized versions of such Abs or 
fragments. Abs may be from any species in which an immune response can be raised 
Humanized Abs are also contemplated. 

Alternatively, a potential antagonist or agonist may be a closely related protein, 
for example, a mutated form of an AAP that recognizes an AAP-interacting protein but 
imparts no effect, thereby competitively inhibiting AAP action. Alternatively, a mutated 
AAP may be constitutively activated and may act as an agonist. 
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Antisense RNA or DNA constructs can be effective antagonists. Antisense RNA 
or DNA molecules block function by inhibiting translation by hybridizing to targeted 
mRNA. Antisense technology can be used to control gene expression through triple-helix 
formation or antisense DNA or RNA, both of which depend on polynucleotide binding to 
DNA or RNA. For example, the 5' coding portion of an AAP sequence is used to design 
an antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA 
oligonucleotide is designed to be complementary to a region of the gene involved in 
transcription (triple helix) (Beal and Dervan, 1991; Cooney et ah, 1988; Lee et aL, 1979), 
thereby preventing transcription and the production of the AAP. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA 
molecule into the AAP (antisense) (Cohen, 1989; Okano et aL, 1991). These 
oligonucleotides can also be delivered to cells such that the antisense RNA or DNA may 
be expressed in vivo to inhibit production of the AAP. When antisense DNA is used, 
oligodeoxyribonucleotides derived from the translation-initiation site, e.g., between about 
-10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific 
cleavage of RNA. Ribozymes act by sequence-specific hybridization to the 
complementary target RNA, followed by endonucleolytic cleavage. Specific ribozyme 
cleavage sites within a potential RNA target can be identified by known techniques (WO 
97/33551, 1997; Rossi, 1994). 

To inhibit transcription, triple-helix nucleic acids that are single-stranded and 
comprise deoxynucleotides are useful antagonists. These oligonucleotides are designed 
such that triple-helix formation via Hoogsteen base-pairing rules is promoted, generally 
requiring stretches of purines or pyrimidines (WO 97/33551, 1997). 

Aptamers are short oligonucleotide sequences that can be used to recognize and 
specifically bind almost any molecule. The systematic evolution of ligands by 
exponential enrichment (SELEX) process (Ausubel et aL, 1987; Ellington and Szostak, 
1990; Tuerk and Gold, 1990) is powerful and can be used to find such aptamers. 
Aptamers have many diagnostic and clinical uses; almost any use in which an antibody 
has been used clinically or diagnostically, aptamers too may be used. In addition, they 
are cheaper to make once they have been identified, and can be easily applied in a variety 
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of formats, including administration in pharmaceutical compositions, in bioassays, and 
diagnostic tests (Jayasena, 1999). 

Anti-AAP Abs 

The invention encompasses Abs and antibody fragments, such as F ab or (F a b)2,that 
bind immunospecifically to any AAP epitopes. 

"Antibody" (Ab) comprises single Abs directed against an AAP (anti-AAP Ab; 
including agonist, antagonist, and neutralizing Abs), anti-AAP Ab compositions with 
poly-epitope specificity, single chain anti-AAP Abs, and fragments of anti-AAP Abs. A 
"monoclonal antibody" is obtained from a population of substantially homogeneous Abs, 
i.e.j the individual Abs comprising the population are identical except for possible 
naturally-occurring mutations that may be present in minor amounts. Exemplary Abs 
include polyclonal (pAb), monoclonal (mAb), humanized, bi-specific (bsAb), and 
heteroconjugate Abs. 

1 . Polyclonal Abs (pAbs) 

Polyclonal Abs can be raised in a mammalian host, for example, by one or more 
injections of an immunogen and, if desired, an adjuvant. Typically, the immunogen 
and/or adjuvant are injected in the mammal by multiple subcutaneous or intraperitoneal 
injections. The immunogen may include an AAP or a fusion protein. Examples of 
adjuvants include Freund's complete and monophosphoryl Lipid A synthetic-trehalose 
dicorynomycolate (MPL-TDM). To improve the immune response, an immunogen may 
be conjugated to a protein that is immunogenic in the host, such as keyhole limpet 
hemocyanin (KLH), serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. 
Protocols for antibody production are described by (Ausubel et al., 1987; Harlow and 
Lane, 1988). Alternatively, pAbs may be made in chickens, producing IgY molecules 
(Schade etal, 1996). 

2. Monoclonal Abs ( mAbs) 

Anti-AAP mAbs may be prepared using hybridoma methods (Milstein and Cuello, 
1983). Hybridoma methods comprise at least four steps: (1) immunizing a host, or 
lymphocytes from a host; (2) harvesting the mAb secreting (or potentially secreting) 
lymphocytes, (3) fusing the lymphocytes to immortalized cells, and (4) selecting those 
cells that secrete the desired (anti-AAP) mAb. 
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A mouse, rat, guinea pig, hamster, or other appropriate host is immunized to elicit 
lymphocytes that produce or are capable of producing Abs that will specifically bind to 
the immunogen. Alternatively, the lymphocytes may be immunized in vitro. If human 
cells are desired, peripheral blood lymphocytes (PBLs) are generally used; however, 
spleen cells or lymphocytes from other mammalian sources are preferred. The 
immunogen typically includes an AAP or a fusion protein. 

The lymphocytes are then fused with an immortalized cell line to form hybridoma 
cells, facilitated by a fusing agent such as polyethylene glycol (Goding, 1996). Rodent, 
bovine, or human myeloma cells immortalized by transformation may be used, or rat or 
mouse myeloma cell lines. Because pure populations of hybridoma cells and not unfused 
immortalized cells are preferred, the cells after fusion are grown in a suitable medium that 
contains one or more substances that inhibit the growth or survival of unfused, 
immortalized cells. A common technique uses parental cells that lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT). In this case, 
hypoxanthine, aminopterin and thymidine are added to the medium (HAT medium) to 
prevent the growth of HGPRT-deficient cells while permitting hybridomas to grow. 

Preferred immortalized cells fuse efficiently, can be isolated from mixed 
populations by selecting in a medium such as HAT, and support stable and high-level 
expression of antibody after fusion. Preferred immortalized cell lines are murine 
myeloma lines, available from the American Type Culture Collection (Manassas, VA). 
Human myeloma and mouse-human heteromyeloma cell lines also have been described 
for the production of human mAbs (Kozbor et a/., 1984; Schook, 1987). 

Because hybridoma cells secrete antibody extracellularly, the culture media can be 
assayed for the presence of mAbs directed against an AAP (anti-AAP mAbs). 
Immunoprecipitation or in vitro binding assays, such as radio immunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA), measure the binding specificity of 
mAbs (Harlow and Lane, 1988; Harlow and Lane, 1999), including Scatchard analysis 
(Munson and Rodbard, 1980). 

Anti-AAP mAb secreting hybridoma cells may be isolated as single clones by 
limiting dilution procedures and sub-cultured (Goding, 1996). Suitable culture media 
include Dulbecco's Modified Eagle's Medium, RPMI-1640, or if desired, a protein-free 
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or -reduced or serum-free medium (e.g., Ultra DOMA PF or HL-1; Biowhittaker; 
Walkersville, MD). The hybridoma cells may also be grown in vivo as ascites. 

The mAbs may be isolated or purified from the culture medium or ascites fluid by 
conventional Ig purification procedures such as protein A-Sepharose, hydroxylapatite 
chromatography, gel electrophoresis, dialysis, ammonium sulfate precipitation or affinity 
chromatography (Harlow and Lane, 1988; Harlow and Lane, 1999). 

The mAbs may also be made by recombinant methods (U.S. Patent No. 4166452, 
1979). DNA encoding anti-AAP mAbs can be readily isolated and sequenced using 
conventional procedures, e.g., using oligonucleotide probes that specifically bind to 
murine heavy and light antibody chain genes, to probe preferably DNA isolated from 
anti-AAP-secreting mAb hybridoma cell lines. Once isolated, the isolated DNA 
fragments are sub-cloned into expression vectors that are then transfected into host cells 
such as simian COS-7 cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do 
not otherwise produce Ig protein, to express mAbs. The isolated DNA fragments can be 
modified, for example, by substituting the coding sequence for human heavy and light 
chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4816567, 1989; Morrison et aL, 1987), or by fusing the Ig coding sequence to all or part 
of the coding sequence for a non-Ig polypeptide. Such a non-Ig polypeptide can be 
substituted for the constant domains of an antibody, or can be substituted for the variable 
domains of one antigen-combining site to create a chimeric bivalent antibody. 

3 . Monovalent Abs 

The Abs may be monovalent Abs that consequently do not cross-link with each 
other. For example, one method involves recombinant expression of Ig light chain and 
modified heavy chain. Heavy chain truncations generally at any point in the F c region 
will prevent heavy chain cross-linking. Alternatively, the relevant cysteine residues are 
substituted with another amino acid residue or are deleted, preventing crosslinking. In 
vitro methods are also suitable for preparing monovalent Abs. Abs can be digested to 
produce fragments, such as F ab fragments (Harlow and Lane, 1988; Harlow and Lane, 
1999). 

4. Humanized and human Abs 

Anti-AAP Abs may further comprise humanized or human Abs. Humanized 
forms of non-human Abs are chimeric Igs, Ig chains or fragments (such as F v , F a t» F a bs 
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F ( ab-)2 or other antigen-binding subsequences of Abs) that contain minimal sequence 
derived from non-human Ig. 

Generally, a humanized antibody has one or more amino acid residues introduced 
from a non-human source. These non-human amino acid residues are often referred to as 
"import" residues, which are typically taken from an "import" variable domain. 
Humanization is accomplished by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody (Jones et al, 1986; Riechmann et a/., 
1988; Verhoeyen et al, 1988). Such "humanized" Abs are chimeric Abs (U.S. Patent No. 
4816567, 1989), wherein substantially less than an intact human variable domain has 
been substituted by the corresponding sequence from a non-human species. In practice, 
humanized Abs are typically human Abs in which some CDR residues and possibly some 
FR residues are substituted by residues from analogous sites in rodent Abs. Humanized 
Abs include human Igs (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit, having the desired 
specificity, affinity and capacity. In some instances, corresponding non-human residues 
replace F v framework residues of the human Ig. Humanized Abs may comprise residues 
that are found neither in the recipient antibody nor in the imported CDR or framework 
sequences. In general, the humanized antibody comprises substantially all of at least one, 
and typically two, variable domains, in which most if not all of the CDR regions 
correspond to those of a non-human Ig and most if not all of the FR regions are those of a 
human Ig consensus sequence. The humanized antibody optimally also comprises at least 
a portion of an Ig constant region (F c ), typically that of a human Ig (Jones et a/., 1986; 
Presta, 1992; Riechmann etai, 1988). 

Human Abs can also be produced using various techniques, including phage 
display libraries (Hoogenboom et al, 1991; Marks et al, 1991) and the preparation of 
human mAbs (Boerner et aL, 1991; Reisfeld and Sell, 1985). Similarly, introducing 
human Ig genes into transgenic animals in which the endogenous Ig genes have been 
partially or completely inactivated can be exploited to synthesize human Abs. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire 
(U.S. Patent No. 5545807, 1996; U.S. Patent No. 5545806, 1996; U.S. Patent No. 
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5569825, 1996; U.S. Patent No. 5633425, 1997; U.S. Patent No. 5661016 1997- U S 
Patent No. 5625126, 1997; Fishw.ld * al, 1996; Lonberg and Huszar, 1995; Lonberg et 
al., 1994; Marks et al., 1992). 

5. Bi-specific mAbs 

Bi-specific Abs are monoclonal, preferably human or humanized, that have 
binding specificities for at least two different antigens. For example, a binding specificity 
is an AAP; the other is for any antigen of choice, preferably a cell-surface protein or 
receptor or receptor subunit. 

Traditionally, the recombinant production of bi-specific Abs is based on the co- 
expression of two Ig heavy-chain/light-chain pairs, where the two heavy chains have 
different specificities (Milstein and Cuello, 1983). Because of the random assortment of 
Ig heavy and light chains, the resulting hybridomas (quadromas) produce a potential 
mixture of ten different antibody molecules, of which only one has the desired bi-spec,fic 
structure. The desired antibody can be purified using affinity chromatography or other 
techniques (WO 93/08829, 1993; Traunecker et al. , 1991). 

To manufacture a bi-specific antibody (Suresh et al., 1986), variable domains with 
the desired antibody-antigen combining sites are fused to Ig constant doma,n sequences 
The fusion is preferably with an Ig heavy-chain constant domain, comprising at least part 
of the hinge, CH2, and CH3 regions. Preferably, the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding is in at least one of the 
fusions. DNAs encoding the Ig heavy-chain fusions and, if desired, the Ig light chain are 
inserted into separate expression vectors and are co-transfected into a suitable host 
organism. 

The interface between a pair of antibody molecules can be engineered to 
maximize the percentage of heterodimers that are recovered from recombinant cell 
culture (WO 96/2701 1, 1996). The prefen-ed interface comprises at least part of the CH3 
region of an antibody constant domain. In this method,.one or more small amino acid 
s.de chains from the interface of the first antibody molecule are replaced with larger side 
chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size 
to the large side chain(s) are created on the interface of the second antibody molecule by 
replacing large amino acid side chains with smaller ones (e.g. alanine or threonine) This 
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mechanism increases the yield of the heterodimer over unwanted end products such as 
homodimers. 

Bi-specific Abs can be prepared as full length Abs or antibody fragments (e.g. 
F (ab )2 bi-specific Abs). One technique to generate bi-specific Abs exploits chemical 
linkage. Intact Abs can be proteolytically cleaved to generate F (ab )2 fragments (Brennan 
et al, 1985). Fragments are reduced with a dithiol complexing agent, such as sodium 
arsenite, to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The 
generated F ab - fragments are then converted to thionitrobenzoate (TNB) derivatives. One 
of the F a b-TNB derivatives is then reconverted to the F ab -thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the other F ab -TNB 
derivative to form the bi-specific antibody. The produced bi-specific Abs can be used as 
agents for the selective immobilization of enzymes. 

F ab - fragments may be directly recovered from E. coli and chemically coupled to 
form bi-specific Abs. For example, fully humanized bi-specific F (ab )2 Abs can be 
produced (Shalaby et al, 1992). Each F ab > fragment is separately secreted from E. coli 
and directly coupled chemically in vitro, forming the bi-specific antibody. 

Various techniques for making and isolating bi-specific antibody fragments 
directly from recombinant cell culture have also been described. For example, leucine 
zipper motifs can be exploited (Kostelny et al, 1992). Peptides from the Fos and Jun 
proteins are linked to the F ab - portions of two different Abs by gene fusion. The antibody 
homodimers are reduced at the hinge region to form monomers and then re-oxidized to 
form antibody heterodimers. This method can also produce antibody homodimers. The 
"diabody" technology (Holliger et al, 1993) provides an alternative method to generate 
bi-specific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V H ) connected to a light-chain variable domain (V L ) by a linker that is too short to allow 
pairing between the two domains on the same chain. The V H and V L domains of one 
fragment are forced to pair with the complementary V L and V H domains of another 
fragment, forming two antigen-binding sites. Another strategy for making bi-specific 
antibody fragments is the use of single-chain F v (sF v ) dimers (Gruber et al, 1994). Abs 
with more than two valencies are also contemplated, such as tri-specific Abs (Tutt et al, 
1991). 
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Exemplary bi-specific Abs may bind to two different epitopes on a given AAP. 
Alternatively, cellular defense mechanisms can be restricted to a particular cell expressing 
the particular AAP: an anti-AAP arm may be combined with an arm that binds to a 
leukocyte triggering molecule, such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, 
or B7), or to F c receptors for IgG (F c yR), such as F c yRI (CD64), F c yRII (CD32) and 
F c yRin (CD 16). Bi-specific Abs may also be used to target cytotoxic agents to cells that 
express a particular AAP. These Abs possess an AAP-binding arm and an arm that binds 
a cytotoxic agent or a radionuclide chelator. 

6. Heteroconjugate Abs 

Heteroconjugate Abs, consisting of two covalently joined Abs, have been 
proposed to target immune system cells to unwanted cells (4,676,980, 1987) and for 
treatment of human immunodeficiency virus (HIV) infection (WO 91/00360, 1991; WO 
92/20373, 1992). Abs prepared in vitro using synthetic protein chemistry methods, 
including those involving cross-linking agents, are contemplated. For example, 
immunotoxins may be constructed using a disulfide exchange reaction or by forming a 
thioether bond. Examples of suitable reagents include iminothiolate and methyl-4- 
mercaptobutyrimidate (4,676,980, 1987). 

7 . Immunocon jugate s 

Immunoconjugates may comprise an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin or fragment 
of bacterial, fungal, plant, or animal origin), or a radioactive isotope (i.e., a - 
radioconjugate). 

Useful enzymatically-active toxins and fragments include Diphtheria A chain, 
non-binding active fragments of Diphtheria toxin, exotoxin A chain from Pseudomonas 
aeruginosa, ricin A chain, abrin A chain, modeccin A chain, a-sarcin, Aleurites fordii 
proteins, Dianthin proteins, Phytolaca americana proteins, Momordica charantia 
inhibitor, curcin, crotin, Sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, 
phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available 
for the production of radioconjugated Abs, such as 212 Bi, 131 I, 13l In, 90 Y, and l86 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bi- 
functional protein-coupling agents, such as N-succinimidyl-3-(2-pyridyldithiol) 
propionate (SPDP), iminothiolane (IT), bi-functional derivatives of imidoesters (such as 
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dimethyl adipimidate HC1), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), Ws-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
ethylenediamine), diisocyanates (such as tolyene 2,6- diisocyanate), and Ws-active 
fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared (Vitetta et a/., 1987). !4 C-labeled 1-isothiocyanatobenzyl- 
3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating 
agent for conjugating radionuclide to antibody (WO 94/1 1026, 1994). 

In another embodiment, the antibody may be conjugated to a "receptor" (such as 
streptavidin) for utilization in tumor pre-targeting wherein the antibody-receptor 
conjugate is administered to the patient, followed by removal of unbound conjugate from 
the circulation using a clearing agent and then administration of a streptavidin "ligand" 
{e.g., biotin) that is conjugated to a cytotoxic agent {e.g., a radionuclide). 

8. Effector function engineering 

The antibody can be modified to enhance its effectiveness in treating a disease, 
such as cancer. For example, cysteine residue(s) may be introduced into the F c region, 
thereby allowing interchain disulfide bond formation in this region. Such homodimeric 
Abs may have improved internalization capability and/or increased complement-mediated 
cell killing and antibody-dependent cellular cytotoxicity (ADCC) (Caron et aL, 1992; 
Shopes, 1992). Homodimeric Abs with enhanced anti-tumor activity can be prepared 
using hetero-bifunctional cross-linkers (Wolff et al, 1993). Alternatively, an antibody 
engineered with dual F c regions may have enhanced complement lysis (Stevenson et aL, 
1989). 

9. Immunoliposomes 

Liposomes containing the antibody may also be formulated (U.S. Patent No. 
4485045, 1984; U.S. Patent No. 4544545, 1985; U.S. Patent No. 5013556, 1991; Eppstein 
et al., 1985; Hwang et aL, 1980). Useful liposomes can be generated by a reverse-phase 
evaporation method with a lipid composition comprising phosphatidylcholine, 
cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG- PE). Such 
preparations are extruded through filters of defined pore size to yield liposomes with a 
desired diameter. F a t>' fragments of the antibody can be conjugated to the liposomes 
(Martin and Papahadjopoulos, 1982) via a disulfide-interchange reaction. A 
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chemotherapeutic agent, such as Doxorubicin, may also be contained in the liposome 
(Gabizon et al. y 1989). Other useful liposomes with different compositions are 
contemplated. 

10. Diagnostic applications of Abs directed against an AAP 

Anti-AAP Abs can be used to localize and/or quantitate an AAP (e.g., for use in 
measuring levels of an AAP within tissue samples or for use in diagnostic methods, etc). 
Anti-AAP epitope Abs can be utilized as pharmacologically-active compounds. 

Anti-AAP Abs can be used to isolate an AAP by standard techniques, such as 
immunoaffinity chromatography or immunoprecipitation. These approaches facilitate 
purifying an endogenous AAP antigen-containing polypeptides from cells and tissues. 
These approaches, as well as others, can be used to detect an AAP in a sample to evaluate 
the abundance and pattern of expression of the antigenic protein. Anti-AAP Abs can be 
used to monitor protein levels in tissues as part of a clinical testing procedure; for 
example, to determine the efficacy of a given treatment regimen. Coupling the antibody 
to a detectable substance (label) allows detection of Ab-antigen complexes. Classes of 
labels include fluorescent, luminescent, bioluminescent, and radioactive materials, 
enzymes and prosthetic groups. Useful labels include horseradish peroxidase, alkaline 
phosphatase, P-galactosidase, acetylcholinesterase, streptavidin/biotin, avidin/biotin, 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride, phycoerythrin, luminol, luciferase, luciferin, aequorin, and 
,2 V 31 I, 35 Sor 3 H. 

1 1 . Antibody therapeutics 

Abs of the invention, including polyclonal, monoclonal, humanized and fully 
human Abs, can be used therapeutically. Such agents will generally be employed to treat 
or prevent a disease or pathology in a subject. An antibody preparation, preferably one 
having high antigen specificity and affinity generally mediates an effect by binding the 
target epitope(s). Generally, administration of such Abs may mediate one of two effects: 
(1) the antibody may prevent ligand binding, eliminating endogenous ligand binding and 
subsequent signal transduction, or (2) the antibody elicits a physiological result by 
binding an effector site on the target molecule, initiating signal transduction. 

A therapeutically effective amount of an antibody relates generally to the amount 
needed to achieve a therapeutic objective, epitope binding affinity, administration rate, 
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and depletion rate of the antibody from a subject. Common ranges for therapeutically 
effective doses may be, as a nonlimiting example, from about 0.1 mg/kg body weight to 
about 50 mg/kg body weight. Dosing frequencies may range, for example, from twice 
daily to once a week. 

12. Pharmaceutical compositions ofAbs 

Anti-AAP Abs, as well as other AAP interacting molecules (such as aptamers) 
identified in other assays, can be administered in pharmaceutical compositions to treat 
various disorders. Principles and considerations involved in preparing such compositions, 
as well as guidance in the choice of components can be found in (de Boer, 1994; 
Gennaro, 2000; Lee, 1990). 

Since some AAP are intracellular, Abs that are internalized are preferred used 
when whole Abs are used as inhibitors. Liposomes may also be used as a delivery 
vehicle for intracellular introduction. Where antibody fragments are used, the smallest 
inhibitory fragment that specifically binds to the epitope is preferred For example, 
peptide molecules can be designed that bind a preferred epitope based on the variable- 
region sequences of a useful antibody. Such peptides can be synthesized chemically 
and/or produced by recombinant DNA technology (Marasco et aL, 1993). Formulations 
may also contain more than one active compound for a particular treatment, preferably 
those with activities that do not adversely affect each other. The composition may 
comprise an agent that enhances function, such as a cytotoxic agent, cytokine, 
chemotherapeutic agent, or growth-inhibitory agent. 

The active ingredients can also be entrapped in microcapsules prepared by 
coacervation techniques or by interfacial polymerization; for example, 
hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

The formulations to be used for in vivo administration are highly preferred to be 
sterile. This is readily accomplished by filtration through sterile filtration membranes or 
any of a number of techniques. 

Sustained-release preparations may also be prepared, such as semi-permeable 
matrices of solid hydrophobic polymers containing the antibody, which matrices are in 
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the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release 
matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate) ) 
or poly(vinylalcohol)), polylactides (Boswell and Scribner, U.S. Patent No. 3,773,919, 
1973), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as injectable 
microspheres composed of lactic acid-glycolic acid copolymer, and poly-D-(-)-3- 
hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid- 
glycolic acid enable release of molecules for over 100 days, certain hydrogels release 
proteins for shorter time periods and may be preferred. 

AAP recombinant expression vectors and host cells 

Vectors are tools used to shuttle DNA between host cells or as a means to express 
a nucleotide sequence. Some vectors function only in prokaryotes, while others function 
in both prokaryotes and eukaryotes, enabling large-scale DNA preparation from 
prokaryotes for expression in eukaryotes. Inserting the DNA of interest, such as an AAP 
nucleotide sequence or a fragment, is accomplished by ligation techniques and/or mating 
protocols well-known to the skilled artisan. Such DNA is inserted such that its 
integration does not disrupt any necessary components of the vector. In the case of 
vectors that are used to express the inserted DNA protein, the introduced DNA is 
operably-linked to the vector elements that govern its transcription and translation. 

Vectors can be divided into two general classes: Cloning vectors are replicating 
plasmid or phage with regions that are non-essential for propagation in an appropriate 
host cell, and into which foreign DNA can be inserted; the foreign DNA is replicated and 
propagated as if it were a component of the vector. An expression vector (such as a 
plasmid, yeast, or animal virus genome) is used to introduce foreign genetic material into 
a host cell or tissue in order to transcribe and translate the foreign DNA. In expression 
vectors, the introduced DNA is operably-linked to elements, such as promoters, that 
signal to the host cell to transcribe the inserted DNA. Some promoters are exceptionally 
useful, such as inducible promoters that control gene transcription in response to specific 
factors. Operably-linking an AAP or anti-sense construct to an inducible promoter can 
control the expression of an AAP or fragments, or anti-sense constructs. Examples of 
classic inducible promoters include those that are responsive to a-interferon, heat-shock, 
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heavy meta. ions, and steroids such as glucocorticoids (Kaufman, 1990) and tetracycline 
Other desirable inducible promoters include those that are not endogenous to the cells in 
which the construct is being introduced, but, however, is responsive in those cells when 
the induction agent is exogenously supplied. 

Vectors have many difference manifestations. A "plasmid" is a circular double 
stranded DNA molecule into which additional DNA segments can be introduced Viral 
vectors can accept additional DNA segments into the v,ral genome. Certain vectors are 
capable of autonomous replication in a host cell (e.g., bacterial vectors having a bacterial 
ong,n of replication and episomal mammalian vectors). Other vectors (e g 
non-episomal mammalian vectors) are integrated into the genome of a host cell upon 
introduction ,nto the host cel., and thereby are replicated along with the host genome In 
general, useful expression vectors are often plasmids. However, other forms of 
expression vectors, such as viral vectors (e.g., replication defective retroviruses 
adenoviruses and adeno-associated viruses) are contemplated. 

Recombinant expression vectors that comprise an AAP (or fragments) regulate an 
AAP transcription by exploiting one or more host cell-respons.ve (or that can be 
manipulated in vitro) regulatory sequences that is operably-linked to an AAP. "Operably- 
hnked" indicates that a nucleotide sequence of interest is linked to regulatory sequences 
such that expression of the nucleotide sequence is achieved. 

Vectors can be introduced in a variety of organisms and/or cells (Table D) 
Alternatively, the vectors can be transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase. 



Organism s 

Prokaryotes 



Table D Examples of hosts for cloning or expression 



Enterobacteri aceae 



E. coli 

K 12 strain MM294 
XI 776 
W3110 
K5 772 



Enterobacter 
Erwinia 
Klebsiella 
Proteus 



ATCC 31,446 
ATCC 31,537 
ATCC 27,325 
ATCC 53,635 
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Table D Examples of hosts for cloning or expression 



Organisms 



Examples 



Salmonella (S. tyhpimurium) 



Serratia (S. marcescans) 



Shigella 
Bacilli (B. subtilis and B. 



licheniformis) 



Pseudomonas (P. aeruginosa) 



Streptomyces 



Sources and References * 



Eukaryotes 



Yeasts 



Saccharomyces cerevisiae 



Schizosaccharomyces pombe 



Kluyveromyces 
K. lactis MW98-8C, 
CBS683, CBS4574 
K. fragilis 
K. bulgaricus 
K. wickeramii 
K. waltii 

K. drosophilarum 
K. thermotolerans 
K. marxianus; yarrowia 



(Fleer et al., 1991) 

(de Louvencourt et al, 1983) 

ATCC 12,424 
ATCC 16,045 
ATCC 24,178 
ATCC 56,500 
ATCC 36,906 



(EPO 402226, 1990^ 




Filamentous Fungi 



Invertebrate cells 



Vertebrate cells 



^Unreferenced cells 
(Manassas, VA). 



Tolypocladium 



Aspergillus (A nidulans and 
A. nigef) 



Drosophila S2 



Spodoptera Sf9 



Chinese Hamster Ovary 
(CHO) 



simian COS 
COS-7 



HEK 293 



(WO 91/00357, 199H 



(Kelly and Hynes, 1985; Tilburn 
et al, 1983; Yelton etal. 1984^ 



ATCCCRL 1651 



are generally available from American Type Culture Collection 
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Vector choice is dictated by the organism or cells being used and the desired fate 
of the vector. Vectors may replicate once in the target cells, or may be "suicide" vectors. 
In general, vectors comprise signal sequences, origins of replication, marker genes, 
enhancer elements, promoters, and transcription termination sequences. The choice of 
these elements depends on the organisms in which the vector will be used and are easily 
determined. Some of these elements may be conditional, such as an inducible or 
conditional promoter that is turned "on" when conditions are appropriate. Examples of 
inducible promoters include those that are tissue-specific, which relegate expression to 
certain cell types, steroid-responsive, or heat-shock reactive. Some bacterial repression 
systems, such as the lac operon, have been exploited in mammalian cells and transgenic 
animals (Fieck et aL, 1992; Wyborski et aL, 1996; Wyborski and Short, 1991). Vectors 
often use a selectable marker to facilitate identifying those cells that have incorporated 
the vector. Many selectable markers are well known in the art for the use with 
prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and auxotrophy 
mutants. 

Using antisense and sense AAP oligonucleotides can prevent an AAP polypeptide 
expression. These oligonucleotides bind to target nucleic acid sequences, forming 
duplexes that block transcription or translation of the target sequence by enhancing 
degradation of the duplexes, terminating prematurely transcription or translation, or by 
other means. 

Antisense or sense oligonucleotides are singe-stranded nucleic acids, either RNA 
or DNA, which can bind a target AAP mRNA (sense) or an AAP DNA (antisense) 
sequences. According to the present invention, antisense or sense oligonucleotides 
comprise a fragment of an AAP DNA coding region of at least about 14 nucleotides, 
preferably from about 14 to 30 nucleotides. In general, antisense RNA or DNA 
molecules can comprise at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 85, 90, 95, 100 bases in length or more. Among others, (Stein and Cohen, 1988; van 
der Krol et aL, 1988a) describe methods to derive antisense or a sense oligonucleotides 
from a given cDNA sequence. 

Modifications of antisense and sense oligonucleotides can augment their 
effectiveness. Modified sugar-phosphodiester bonds or other sugar linkages (WO 
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91/06629, 1991), increase in vivo stability by conferring resistance to endogenous 
nucleases without disrupting binding specificity to target sequences. Other modifications 
can increase the affinities of the oligonucleotides for their targets, such as covalently 
linked organic moieties (WO 90/10448, 1990) or poly-(L)-lysine. Other attachments 
modify binding specificities of the oligonucleotides for their targets, including metal 
complexes or intercalating (e.g. ellipticine) and alkylating agents. 

To introduce antisense or sense oligonucleotides into target cells (cells containing 
the target nucleic acid sequence), any gene transfer method may be used and are well 
known to those of skill in the art. Examples of gene transfer methods include 1) 
biological, such as gene transfer vectors like Epstein-Barr virus or conjugating the 
exogenous DNA to a ligand-binding molecule (WO 91/04753, 1991), 2) physical, such as 
electroporation, and 3) chemical, such as CaP0 4 precipitation and oligonucleotide-lipid 
complexes (WO 90/10448, 1990). 

The terms "host cell" and "recombinant host cell" are used interchangeably. Such 
terms refer not only to a particular subject cell but also to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term. 

Methods of eukaryotic cell transfection and prokaryotic cell transformation are 
well known in the art. The choice of host cell will dictate the preferred technique for 
introducing the nucleic acid of interest. Table E, which is not meant to be limiting, 
summarizes many of the known techniques in the art. Introduction of nucleic acids into 
an organism may also be done with ex vivo techniques that use an in vitro method of 
transfection, as well as established genetic techniques, if any, for that particular organism. 



Table E Methods to introduce nucleic acid into cells 



Cells 


Methods 


References 


Notes 


Prokaryotes 
(bacteria) 


Calcium chloride 


(Cohen et al., 1972; 
Hanahan, 1983; Mandel and 
Higa, 1970) 




Electroporation 


(Shigekawa and Dower, 
1988) 




Eukaryotes 
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Table E Methods to introduce nucleic acid into cells 



Cells 


Methods 


References 


Notes 


Mammalian 
cells 


Calcium phosphate 
transfection 


N-(2- 

Hydroxyethyl)piperazine-AT - 
(2-ethanesulfonic acid 
(HEPES) buffered saline 
solution (Chen and 
Okayama, 1988; Graham and 
van der Eb, 1973; Wigler et 
ai, 1978) 

BES (AW-bis(2- 
hydroxyethyl)-2- 
aminoethanesulfonic acid) 
buffered solution (Ishiura et 
a/., 1982) 


Cells may be 

shocked with 
glycerol or 
dimethylsulfoxide 
(DMSO) to increase 
transfection 
efficiency (Ausubel 
et al., ivo /). 


Diethylaminoethyl 

(DEAE)-Dextran 

transfection 


(Fujita et aL, 1986; Lopata et 
a/., 1984; Selden et a/., 
1986) 


Most useful for 
transient, but not 
stable, transfections. 
Chloroquine can be 
used to increase 
efficiency. 


Electroporation 


(Neumann et a/., 1982; 
Potter, 1988; Potter et al. y 
1984; Wong and Neumann, 
1982) 


Especially useful for 

hard-to-transfect 

lymphocytes. 


Cationic lipid 

reagent 

transfection 


(Elroy-Stein and Moss, 1990; 
Feigner et al, 1987; Rose et 
aL, 1991; Whitt etal, 1990) 


Applicable to both 
in vivo and in vitro 
transfection. 


Retroviral 


Production exemplified by 
(Cepko et ai, 1984; Miller 
ana ouiiimore, 1700, r^ear ei 
ai, 1993) 

Infection in vitro and in vivo: 
(Austin and Cepko, 1990; 
Bodine et a/., 1991; Fekete 

FinH f^pnlco 1 QQ'V T /^TniQphk'ji 

UllU V^^L/IVUj 177 J j 1 *'l 1 llOV-slJIvd 

et al. 7 1986; Turner et ai, 
1990; Williams et al, 1984) 


Lengthy process, 
many packaging 
lines available at 
ATCC. Applicable 
to both in vivo and 

in \jitm tr?*TiQ J ff*r*tir*n 

III VllliJ 11 ClllOlC'l/liUll. 


Polybrene 


(Chaney et ah, 1986; Kawai 
and Nishizawa, 1984) 




Microinjection 


(Capecchi, 1980) 


Can be used to 
establish cell lines 
carrying integrated 
copies of AAP DNA 
sequences. 
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Table E Methods to introduce nucleic acid into cells 



Cells 


ivieinutio 


T? fV* vf* nrpc 






r^roiopjasi lusion 


(Rassoulzadegan et al, 1982; 
ijanun-vjoiuiii c/i., i"oi, 
Schaffner, 1980) 




Insect cells 
(in vitro) 


Baculovirus 
systems 


(Luckow, 1991; Miller, 
1988; O'Reilly ef al., 1992) 


T Tca"Fii1 €r\r in ijjfr/i 

production of 
proteins with 
eukaryotic 

m r*H i fi c s\ ti on q 


Yeast 


Electroporation 


(Becker and Guarente, 1991) 




Lithium acetate 


(Gietz et al, 1998; Ito et al, 

lyo3 ) 




Spheroplast fusion 


(Beggs, 1978; Hinnen et al, 
1978) 


Laborious, can 
produce aneuploids. 


Plant cells 
(general 
reference: 
(Hansen and 
Wright, 
1999)) 


Agrobacterium 
transformation 


(Bechtold and Pelletier, 
1998; Escudero and Hohn, 
1997; Hansen and Chilton, 
lyyy; louraev andal., 




Biolistics 
(microprojectiles) 


(Finer et al, 1999; Hansen 

and Chilton, 1999; Shillito, 
i ooo\ 




Electroporation 
(protoplasts) 


(Fromm et al, 1985; Ou-Lee 
et al, 1986; Rhodes et al, 
iy<So, oaunoers et at., ivoyj 
May be combined with 
liposomes ^incK dnu ai., 




i uiyciiiyiciic 
glycol (PEG) 
treatment 


(Shillito, 1999) 




Liposomes 


May be combined with 
electroporation (Trick and 
al., 1997) 




in planta 
microinjection 


(Leduc and al., 1996; Zhou 
and al., 1983) 




Seed imbibition 


(Trick and al., 1997) 




Laser beam 


(Hoffman, 1996) 




Silicon carbide 
whiskers 


(Thompson and al., 1995) 





Vectors often use a selectable marker to facilitate identifying those cells that have 
incorporated the vector. Many selectable markers are well known in the art for the use 
with prokaryotes, usually antibiotic-resistance genes or the use of autotrophy and 
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auxotrophy mutants. Table F lists often-used selectable markers for mammalian cell 
transfection. 



Table F Useful selectable markers for eukaryote cell transfection 



Selectable Marker 


Selection 


Action 


Reference 


Adenosine deaminase 
(ADA) 


Media includes 9-p-D- 
xylofuranosyl adenine 
(Xyl-A) 


Conversion of Xyl-A to 
Xyl-ATP, which 
incorporates into 
nucleic acids, killing 
cells. ADA detoxifies 


(Kaufman et 
aL, 1986) 


Dihydrofolate 
reductase (DHFR) 


Methotrexate (MTX) 
and dialyzed serum 
(purine-free media) 


MTX competitive 
inhibitor of DHFR. In 
absence of exogenous 
purines, cells require 
DHFR, a necessary 
enzyme in purine 
biosynthesis. 


(Simon sen 
and 

Levinson, 
1983) 


Aminoglycoside 
phosphotransferase 
("APH", "neo", 
"G418") 


G418 


G418, an 
aminoglycoside 
detoxified by APH, 
interferes with 
ribosomal function and 
consequently, 
translation. 


(Southern 
and Berg, 
1982) 


Hygromycin-B- 

phosphotransferase 

(HPH) 


hygromycin-B 


Hygromycin-B, an 
aminocyclitol 
detoxified by HPH, 
disrupts protein 
translocation and 
promotes 
mistranslation. 


(Palmer et 
a/., 1987) 


Thymidine kinase 
(TK) 


Forward selection 

(TK+): Media (HAT) 

incorporates 

aminopterin. 

Reverse selection (TK- 

): Media incorporates 

5-bromodeoxyuridine 

(BrdU). 


Forward: Aminopterin 
forces cells to synthesze 
dTTP from thymidine, a 
pathway requiring TK. 
Reverse: TK 
phosphorylates BrdU, 
which incorporates into 
nucleic acids, killing 
cells. 


(Littlefield, 
1964) 
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A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce an AAR Accordingly, the invention provides methods 
for producing an AAP using the host cells of the invention. In one embodiment, the 
method comprises culturing the host cell of the invention (into which a recombinant 
expression vector encoding an AAP has been introduced) in a suitable medium, such that 
an AAP is produced In another embodiment, the method further comprises isolating an 
AAP from the medium or the host cell. 

Transgenic AAP animals 

Transgenic animals are useful for studying the function and/or activity of an AAP 
and for identifying and/or evaluating modulators of AAP activity. 'Transgenic animals" 
are non-human animals, preferably mammals, more preferably a rodents such as rats or 
mice, in which one or more of the cells include a transgene. Other transgenic animals 
include primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A "transgene" is 
exogenous DNA that is integrated into the genome of a cell from which a transgenic 
animal develops, and that remains in the genome of the mature animal. Transgenes 
preferably direct the expression of an encoded gene product in one or more cell types or 
tissues of the transgenic animal with the purpose of preventing expression of a naturally 
encoded gene product in one or more cell types or tissues (a "knockout" transgenic 
animal), or serving as a marker or indicator of an integration, chromosomal location, or 
region of recombination {e.g. cre/loxP mice). A "homologous recombinant animal" is a 
non-human animal, such as a rodent, in which an endogenous AAP has been altered by an 
exogenous DNA molecule that recombines homologously with an endogenous AAP in a 
(e.g. embryonic) cell prior to development the animal. Host cells with an exogenous AAP 
can be used to produce non-human transgenic animals, such as fertilized oocytes or 
embryonic stem cells into which an AAF-coding sequences have been introduced. Such 
host cells can then be used to create non-human transgenic animals or homologous 
recombinant animals. 

1. Approaches to transgenic animal production 

A transgenic animal can be created by introducing an AAP into the male pronuclei 
of a fertilized oocyte (e.g., by microinjection, retroviral infection) and allowing the oocyte 
to develop in a pseudopregnant female foster animal (pffa). An AAP cDNA sequences 
(SEQ ID NO:l, 3, 5, 7, 9, 1 1, 13 or 15) can be introduced as a transgene into the genome 
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of a non-human animal. Alternatively, a homologue of an AAP, such as the naturally- 
occuring variant of an AAP, can be used as a transgene. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase transgene 
expression. Tissue-specific regulatory sequences can be operably-linked to the AAP 
transgene to direct expression of the AAP to particular cells. Methods for generating 
transgenic animals via embryo manipulation and microinjection, particularly animals such 
as mice, have become conventional in the art, e.g. (Evans et al., U.S. Patent No. 
4,870,009, 1989; Hogan, 0879693843, 1994; Leder and Stewart, U.S. Patent No. 
4,736,866, 1988; Wagner and Hoppe, US Patent No. 4,873,191, 1989). Other non-mice 
transgenic animals may be made by similar methods. A transgenic founder animal, which 
can be used to breed additional transgenic animals, can be identified based upon the 
presence of the transgene in its genome and/or expression of the transgene mRNA in 
tissues or cells of the animals. Transgenic animals can be bred to other transgenic 
animals carrying other transgenes. 

2. Vectors for transgenic animal production 

To create a homologous recombinant animal, a vector containing at least a portion 
of an AAP into which a deletion, addition or substitution has been introduced to thereby 
alter, e.g., disrupt or alter the expression of, an AAP. An AAP can be a murine gene, or 
other AAP homologue, such as a naturally occurring variant. In one approach, a knockout 
vector functionally disrupts an endogenous AAP gene upon homologous recombination, 
and thus a non-functional AAP protein, if any, is expressed. 

Alternatively, the vector can be designed such that, upon homologous 
recombination, an endogenous AAP is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory regfon can be altered to thereby alter the 
expression of an endogenous AAP). In this type of homologous recombination vector, 
the altered portion of the AAP is flanked at its 5 - and 3'-termini by additional nucleic acid 
of the AAP to allow for homologous recombination to occur between the exogenous AAP 
carried by the vector and an endogenous AAP in an embryonic stem cell. The additional 
flanking AAP nucleic acid is sufficient to engender homologous recombination with the 
endogenous AAP. Typically, several kilobases of flanking DNA (both at the 5 - and 3'- 
termini) are included in the vector (Thomas and Capecchi, 1987). The vector is then 
introduced into an embryonic stem cell line (e.g., by electroporation), and cells in which 
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the introduced AAP has homologously-recombined with an endogenous AAP are selected 
(Li etai, 1992). 

3. Introduction of an AAP transgene cells during development 
Selected cells are then injected into a blastocyst of an animal (e.g., a mouse) to 
form aggregation chimeras (Bradley, 1987). A chimeric embryo can then be implanted 
into a suitable pffa and the embryo brought to term. Progeny harboring the 
homologously-recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously-recombined DNA by germline 
transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described (Berns et al, WO 93/04169, 
1993; Bradley, 1991; Kucherlapati et aL, WO 91/01 140, 1991; Le Mouellic and Brullet, 
WO 90/11354, 1990). 

Alternatively, transgenic animals that contain selected systems that allow for 
regulated expression of the transgene can be produced. An example of such a system is 
the cre/loxP recombinase system of bacteriophage PI (Lakso et al., 1992). Another 
recombinase system is the FLP recombinase system of Saccharomyces cerevisiae 
(O'Gorman et al., 1991). If a cre/loxP recombinase system is used to regulate expression 
of the transgene, animals containing transgenes encoding both the Cre recombinase and a 
selected protein are required. Such animals can be produced as "double" transgenic 
animals, by mating an animal containing a transgene encoding a selected protein to 
another containing a transgene encoding a recombinase. 

Clones of transgenic animals can also be produced (Wilmut et aL, 1997). In brief, 
a cell from a transgenic animal can be isolated and induced to exit the growth cycle and 
enter G 0 phase. The quiescent cell can then be fused to an enucleated oocyte from an 
animal of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured to develop to a morula or blastocyte and then transferred to a pffa. 
The offspring borne of this female foster animal will be a clone of the "parent" transgenic 
animal. 

Pharmaceutical compositions 

The AAP nucleic acid molecules, AAP polypeptides, and anti-AAP Abs (active 
compounds) of the invention, and derivatives, fragments, analogs and homologs thereof, 
can be incorporated into pharmaceutical compositions. Such compositions typically 
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comprise the nucleic acid molecule, protein, or antibody and a pharmaceutically 
acceptable carrier. A "pharmaceutically acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, and the like, compatible with pharmaceutical administration (Gennaro, 
2000). Preferred examples of such carriers or diluents include, but are not limited to, 
water, saline, finger's solutions, dextrose solution, and 5% human serum albumin. 
Liposomes and non-aqueous vehicles such as fixed oils may also be used Except when a 
conventional media or agent is incompatible with an active compound, use of these 
compositions is contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

1 . General considerations 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration, including intravenous, intradermal, 
subcutaneous, oral {e.g., inhalation), transdermal (i.e., topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include: a sterile diluent such as water for injection, saline 
solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants 
such as ascorbic acid or sodium bisulfite; chelating agents such as 

ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH 
can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

2. Injectable formulations 

Pharmaceutical compositions suitable for injection include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For intravenous 
administration, suitable carriers include physiological saline, bacteriostatic water, 
CREMOPHOR EL™ (BASF, Parsippany, N.J.) or phosphate buffered saline (PBS). In all 
cases, the composition must be sterile and should be fluid so as to be administered using a 
syringe. Such compositions should be stable during manufacture and storage and must be 
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preserved against contamination from microorganisms such as bacteria and fungi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 
polyol (such as glycerol, propylene glycol, and liquid polyethylene glycol), and suitable 
mixtures. Proper fluidity can be maintained, for example, by using a coating such as 
lecithin, by maintaining the required particle size in the case of dispersion and by using 
surfactants. Various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, and thimerosal, can contain microorganism 
contamination. Isotonic agents, for example, sugars, polyalcohols such as manitol, 
sorbitol, and sodium chloride can be included in the composition. Compositions that can 
delay absorption include agents such as aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., an AAP or anti-AAP antibody) in the required amount in an appropriate solvent 
with one or a combination of ingredients as required, followed by sterilization. 
Generally, dispersions are prepared by incorporating the active compound into a sterile 
vehicle that contains a basic dispersion medium, and the other required ingredients as 
discussed. Sterile powders for the preparation of sterile injectable solutions, methods of 
preparation include vacuum drying and freeze-drying that yield a powder containing the 
active ingredient and any desired ingredient from a sterile solutions. 

3. Oral compositions 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally. Pharmaceutical ly compatible binding agents, and/or adjuvant materials 
can be included. Tablets, pills, capsules, troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as microcrystalline 
cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, PRIMOGEL, or corn starch; a lubricant such as 
magnesium stearate or STEROTES; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
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4. Compositions for inhalation 

For administration by inhalation, the compounds are delivered as an aerosol spray 
from a nebulizer or a pressurized container that contains a suitable propellant, e.g., a gas 
such as carbon dioxide. 

5. Systemic administration 

Systemic administration can also be transmucosal or transdermal. For 
transmucosal or transdermal administration, penetrants that can permeate the target 
barrier(s) are selected. Transmucosal penetrants include, detergents, bile salts, and 
fusidic acid derivatives. Nasal sprays or suppositories can be used for transmucosal 
administration. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams. 

The compounds can also be prepared in the form of suppositories {e.g., with bases 
such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

6. Carriers 

In one embodiment, the active compounds are prepared with carriers that protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Such materials can be 
obtained commercially from ALZA Corporation (Mountain View, CA) and NOVA 
Pharmaceuticals, Inc. (Lake Elsinore, CA), or prepared by one of skill in the art. 
Liposomal suspensions can also be used as pharmaceutical ly acceptable carriers. These 
can be prepared according to methods known to those skilled in the art, such as in 
(Eppstein et aL, US Patent No. 4,522,81 1, 1985). 

7. Unit dosage 

Oral formulations or parenteral compositions in unit dosage form can be created to 
facilitate administration and dosage uniformity. Unit dosage form refers to physically 
discrete units suited as single dosages for the subject to be treated, containing a 
therapeutically effective quantity of active compound in association with the required 
pharmaceutical carrier. The specification for the unit dosage forms of the invention are 
dictated by, and directly dependent on, the unique characteristics of the active compound 
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and the particular desired therapeutic effect, and the inherent limitations of compounding 
the active compound 

8. Gene therapy compositions 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (Nabel and Nabel, US Patent No. 
5,328,470, 1994), or by stereotactic injection (Chen et a/., 1994). The pharmaceutical * 
preparation of a gene therapy vector can include an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, where 
the complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 
produce the gene delivery system. 

9. Dosage 

The pharmaceutical composition and method of the present invention may further 
comprise other therapeutically active compounds as noted herein that are usually applied 
in the treatment of the above mentioned pathological conditions. 

In the treatment or prevention of conditions which require AAP modulation an 
appropriate dosage level will generally be about 0.01 to 500 mg per kg patient body 
weight per day which can be administered in single or multiple doses. Preferably, the 
dosage level will be about 0.1 to about 250 mg/kg per day; more preferably about 0.5 to 
about 100 mg/kg per day. A suitable dosage level may be about 0.01 to 250 mg/kg per 
day, about 0.05 to 100 mg/kg per day, or about 0.1 to 50 mg/kg per day. Within this 
range the dosage may be 0.05 to 0.5, 0.5 to 5 or 5 to 50 mg/kg per day. For oral 
administration, the compositions are preferably provided in the form of tablets containing 
1.0 to 1000 milligrams of the active ingredient, particularly 1.0, 5.0, 10.0, 15.0. 20.0, 
25.0, 50.0, 75.0, 100.0, 150.0, 200.0, 250.0, 300.0, 400.0, 500.0, 600.0, 750.0, 800.0, 
900.0, and 1000.0 milligrams of the active ingredient for the symptomatic adjustment of 
the dosage to the patient to be treated. The compounds may be administered on a 
regimen of 1 to 4 times per day, preferably once or twice per day. 

It will be understood, however, that the specific dose level and frequency of 
dosage for any particular patient may be varied and will depend upon a variety of factors 
including the activity of the specific compound employed, the metabolic stability and 
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length of action of that compound, the age, body weight, general health, sex, diet, mode 
and time of administration, rate of excretion, drug combination, the severity of the 
particular condition, and the host undergoing therapy. 
10. Kits for pharmaceutical compositions 

The pharmaceutical compositions can be included in a kit, container, pack, or 
dispenser together with instructions for administration. When the invention is supplied as 
a kit, the different components of the composition may be packaged in separate containers 
and admixed immediately before use. Such packaging of the components separately may 
permit long-term storage without losing the active components' functions. 

Kits may also include reagents in separate containers that facilitate the execution 
of a specific test, such as diagnostic tests or tissue typing. For example, AAP DNA 
templates and suitable primers may be supplied for internal controls. 

(a) Containers or vessels 

The reagents included in the kits can be supplied in containers of any sort such 
that the life of the different components are preserved, and are not adsorbed or altered by 
the materials of the container. For example, sealed glass ampules may contain 
lyophilized luciferase or buffer that have been packaged under a neutral, non-reacting gas, 
such as nitrogen. Ampoules may consist of any suitable material, such as glass, organic 
polymers, such as polycarbonate, polystyrene, etc., ceramic, metal or any other material 
typically employed to hold reagents. Other examples of suitable containers include 
simple bottles that may be fabricated from similar substances as ampules, and envelopes, 
that may consist of foil-lined interiors, such as aluminum or an alloy. Other containers 
include test tubes, vials, flasks, bottles, syringes, or the like. Containers may have a 
sterile access port, such as a bottle having.a stopper that can be pierced by a hypodermic 
injection needle. Other containers may have two compartments that are separated by a 
readily removable membrane that upon removal permits the components to mix. 
Removable membranes may be glass, plastic, rubber, etc. 

(b) Instructional materials 

Kits may also be supplied with instructional materials. Instructions may be 
printed on paper or other substrate, and/or may be supplied as an electronic-readable 
medium, such as a floppy disc, CD-ROM, DVD-ROM, Zip disc, videotape, audiotape, 
etc. Detailed instructions may not be physically associated with the kit; instead, a user 
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may be directed to an internet web site specified by the manufacturer or distributor of the 
kit, or supplied as electronic mail. 

Screening and detection methods 

The isolated nucleic acid molecules of the invention can be used to express an 
AAP {e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect an AAP mRNA {e.g., in a biological sample) or a genetic lesion in 
an AAP, and to modulate AAP activity, as described below. In addition, AAP 
polypeptides can be used to screen drugs or compounds that modulate the AAP activity or 
expression as well as to treat disorders characterized by insufficient or excessive 
production of an AAP or production of AAP forms that have decreased or aberrant 
activity compared to an AAP wild-type protein, or modulate biological function that 
involve an AAP. In addition, the anti-AAP Abs of the invention can be used to detect and 
isolate an AAP and modulate AAP activity. 

1 . Screening assays 

The invention provides a method (screening assay) for identifying modalities, i.e., 
candidate or test compounds or agents {e.g., peptides, peptidomimetics, small molecules 
or other drugs), foods, combinations thereof, etc., that effect an AAP, a stimulatory or 
inhibitory effect, inlcuding translation, transcription, activity or copies of the gene in 
cells. The invention also includes compounds identified in screening assays. 

Testing for compounds that increase or decrease AAP activity are desirable. A 
compound may modulate an AAP activity by affecting: (1) the number of copies of the 
gene in the cell (amplifiers and deamplifiers); (2) increasing or decreasing transcription of 
an AAP (transcription up-regulators and down-regulators); (3) by increasing or decreasing 
the translation of an AAP mRNA into protein (translation up-regulators and down- 
regulators); or (4) by increasing or decreasing the activity of an AAP itself (agonists and 
antagonists). 

(a) effects of compounds 

To identify compounds that affect an AAP at the DNA, RNA and protein levels, 
cells or organisms are contacted with a candidate compound and the corresponding 
change in an AAP DNA, RNA or protein is assessed (Ausubel et al, 1987). For DNA 
amplifiers and deamplifiers, the amount of an AAP DNA is measured, for those 
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compounds that are transcription up-regulators and down -regulators the amount of an 
AAP mRNA is determined; for translational up- and down-regulators, the amount of an 
AAP polypeptide is measured. Compounds that are agonists or antagonists may be 
identified by contacting cells or organisms with the compound, and then examining, for 
example, the model of angiogenesis in vitro. 

In one embodiment, many assays for screening candidate or test compounds that 
bind to or modulate the activity of an AAP or polypeptide or biologically-active portion 
are available. Ttest compounds can be obtained using any of the numerous approaches in 
combinatorial library methods, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptides, while the other four approaches encompass peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, 1997). 

(b) small molecules 

A "small molecule" refers to a composition that has a molecular weight of less 
than about 5 kD and more preferably less than about 4 kD, most preferably less than 0.6 
kD. Small molecules can be, nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical 
and/or biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art 
and can be screened with any of the assays of the invention. Examples of methods for the 
synthesis of molecular libraries can be found in: (Carell et a/., 1994a; Carell et ai, 1994b; 
Cho et a/., 1993; DeWitt et aL, 1993; Gallop et aL, 1994; Zuckermann et ai, 1994). 

Libraries of compounds may be presented in solution (Houghten et al y 1992) or 
on beads (Lam et ai, 1991), on chips (Fodor et a/., 1993), bacteria, spores (Ladner et ai, 
US Patent No. 5,223,409, 1993), plasmids (Cull et al. 9 1992) or on phage (Cwirla et ai, 
1990; Devlin et ai, 1990; Felici et al., 1991; Ladner et ai, US Patent No. 5,223,409, 
1993; Scott and Smith, 1990). A cell-free assay comprises contacting an AAP or 
biologically-active fragment with a known compound that binds the AAP to form an 
assay mixture, contacting the assay mixture with a test compound, and determining the 
ability of the test compound to interact with the AAP, where determining the ability of the 
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test compound to interact with the AAP comprises determining the ability of the AAP to 
preferentially bind to or modulate the activity of an AAP target molecule. 

(c) cell-free assays 

The cell-free assays of the invention may be used with both soluble or a 
membrane-bound forms of an AAP. In the case of cell-free assays comprising the 
membrane-bound form, a solubilizing agent to maintain the AAP in solution. Examples 
of such solubilizing agents include non-ionic detergents such as n-octylglucoside, n- 
dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 
methylglucamide, TRITON® X-100 and others from the TRITON® series, THESIT®, 
Isotridecypoly(ethylene glycol ether) n , N-dodecyl— N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-l-propane sulfonate (CHAPS), or 3- 
(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

(d) immobilization of target molecules to facilitate screening 

In more than one embodiment of the assay methods, immobilizing either an AAP 
or its partner molecules can facilitate separation of complexed from uncomplexed forms 
of one or both of the proteins, as well as to accommodate high throughput assays. 
Binding of a test compound to an AAP, or interaction of an AAP with a target molecule 
in the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants, such as microtiter plates, test tubes, and 
micro-centrifuge tubes. A fusion protein can be provided that adds a domain that allows 
one or both of the proteins to be bound to a matrix. For example, GST-AAP fusion 
proteins or GST-target fusion proteins can be adsorbed onto glutathione sepharose beads 
(SIGMA Chemical, St. Louis, MO) or glutathione derivatized microtiter plates that are 
then combined with the test compound or the test compound and either the non-adsorbed 
target protein or an AAP, and the mixture is incubated under conditions conducive to 
complex formation {e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microtiter plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined either 
directly or indirectly, for example, as described. Alternatively, the complexes can be 
dissociated from the matrix, and the level of AAP binding or activity determined using 
standard techniques. 
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Other techniques for immobilizing proteins on matrices can also be used in 
screening assays. Either an AAP or its target molecule can be immobilized using biotin- 
avidin or biotin-streptavidin systems. Biotinylation can be accomplished using many 
reagents, such as biotin-NHS (N-hydroxy-succinimide; PIERCE Chemicals, Rockford, 
IL), and immobilized in wells of streptavidin-coated 96 well plates (PIERCE Chemical). 
Alternatively, Abs reactive with an AAP or target molecules, but which do not interfere 
with binding of the AAP to its target molecule, can be derivatized to the wells of the 
plate, and unbound target or an AAP trapped in the wells by antibody conjugation. 
Methods for detecting such complexes, in addition to those described for the 
GST-immobilized complexes, include immunodetection of complexes using Abs reactive 
with an AAP or its target, as well as enzyme-linked assays that rely on detecting an 
enzymatic activity associated with the AAP or target molecule. 

(e) screens to identify modulators 

Modulators of AAP expression can be identified in a method where a cell is 
contacted with a candidate compound and the expression of an AAP mRNA or protein in 
the cell is determined. The expression level of the AAP mRNA or protein in the presence 
of the candidate compound is compared to the AAP mRNA or protein levels in the 
absence of the candidate compound. The candidate compound can then be identified as a 
modulator of the AAP mRNA or protein expression based upon this comparison. For 
example, when expression of an AAP mRNA or protein is greater {i.e., statistically 
significant) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of the AAP mRNA or protein expression. 
Alternatively, when expression of the AAP mRNA or protein is less (statistically 
significant) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of the AAP mRNA or protein expression. The 
level of an AAP mRNA or protein expression in the cells can be determined by methods 
described for detecting an AAP mRNA or protein, 
(i) hybrid assays 

In yet another aspect of the invention, an AAP can be used as "bait" in two-hybrid 
or three hybrid assays (Bartel et ai, 1993; Brent et ai, WO94/10300, 1994; Iwabuchi et 
al. 9 1993; Madura et al., 1993; Saifer et ai, US Patent No. 5,283,317, 1994; Zervos et aL 9 
1993) to identify other proteins that bind or interact with the AAP and modulate AAP 
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activity. Such AAP-bps are also likely to be involved in the propagation of signals by the 
AAP as, for example, upstream or downstream elements of an AAP pathway. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for an 
AAP is fused to a gene encoding the DNA binding domain of a known transcription 
factor (e.g., GALA). The other construct, a DNA sequence from a library of DNA 
sequences that encodes an unidentified protein ("prey" or "sample") is fused to a gene that 
codes for the activation domain of the known transcription factor. If the "bait" and the 
"prey" proteins are able to interact in vivo, forming an AAP-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) that is 
operably-linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected, and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene that 
encodes the AAP-interacting protein. 

The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein. 
2. Detection assays 

Portions or fragments of an AAP cDNA sequences identified herein (and the 
complete AAP gene sequences) are useful in themselves. By way of non-limiting 
example, these sequences can be used to: (1) identify an individual from a minute 
biological sample (tissue typing); and (2) aid in forensic identification of a biological 
sample. 

(a) Tissue typing 

The AAP sequences of the invention can be used to identify individuals from 
minute biological samples. In this technique, an individual's genomic DNA is digested 
with one or more restriction enzymes and probed on a Southern blot to yield unique 
bands. The sequences of the invention are useful as additional DNA markers for 
"restriction fragment length polymorphisms" (RFLP; (Smulson et al, US Patent No. 
5,272,057, 1993)). 
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Furthennore, the AAP sequences can be used to determine the actual base-by-base 
DNA sequence of targeted portions of an individual's genome. AAP sequences can be 
used to prepare two PCR primers from the 5'- and 3-termini of the sequences that can 
then be used to amplify an the corresponding sequences from an individual's genome and 
then sequence the amplified fragment. 

Panels of corresponding DNA sequences from individuals can provide unique 
individual identifications, as each individual will have a unique set of such DNA 
sequences due to allelic differences. The sequences of the invention can be used to obtain 
such identification sequences from individuals and from tissue. The AAP sequences of 
the invention uniquely represent portions of an individual's genome. Allelic variation 
occurs to some degree in the coding regions of these sequences, and to a greater degree in 
the noncoding regions. The allelic variation between individual humans occurs with a 
frequency of about once ever 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include RFLPs. 

Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. 
Because greater numbers of polymorphisms occur in noncoding regions, fewer sequences 
are necessary to differentiate individuals. Noncoding sequences can positively identify 
individuals with a panel of 10 to 1,000 primers that each yield a noncoding amplified 
sequence of 100 bases. If predicted coding sequences, such as those in SEQ ID NOS: 1, 3, 
5, 7, 9, 1 1, 13, and 15 are used, a more appropriate number of primers for positive 
individual identification would be 500-2,000. 

Predictive medicine 

The invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to treat an individual prophylactically. Accordingly, one 
aspect of the invention relates to diagnostic assays for determining an AAP and/or nucleic 
acid expression as well as AAP activity, in the context of a biological sample (e.g., blood, 
serum, cells, tissue) to determine whether an individual is afflicted with a disease or 
disorder, or is at risk of developing a disorder, associated with aberrant AAP expression 
or activity, including cancer. The invention also provides for prognostic (or predictive) 
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assays for determining whether an individual is at risk of developing a disorder associated 
with an AAP, nucleic acid expression or activity. For example, mutations in an AAP can 
be assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to prophylactically treat an individual prior to the onset of a disorder 
characterized by or associated with the AAP, nucleic acid expression, or biological 
activity. 

Another aspect of the invention provides methods for determining AAP activity, 
or nucleic acid expression, in an individual to select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of modalities (e.g., drugs, foods) for 
therapeutic or prophylactic treatment of an individual based on the individual's genotype 
(e.g., the individual's genotype to determine the individual's ability to respond to a 
particular agent). Another aspect of the invention pertains to monitoring the influence of 
modalities (e.g., drugs, foods) on the expression or activity of an AAP in clinical trials. 

1 . Diagnostic assays 

An exemplary method for detecting the presence or absence of an AAP in a 
biological sample involves obtaining a biological sample from a subject and contacting 
the biological sample with a compound or an agent capable of detecting the AAP or the 
AAP nucleic acid (e.g., mRNA, genomic DNA) such that the presence of the AAP is 
confirmed in the sample. An agent for detecting the AAP mRNA or genomic DNA is a 
labeled nucleic acid probe that can hybridize to the AAP mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length AAP nucleic acid, such as the nucleic 
acid of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13 or 15, or a portion thereof, such as an 
oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient 
to specifically hybridize under stringent conditions to an AAP mRNA or genomic DNA. 

An agent for detecting an AAP polypeptide is an antibody capable of binding to 
the AAP, preferably an antibody with a detectable label. Abs can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment (e.g., F a b or F(ab')2) can be 
used A labeled probe or antibody is coupled (i.e., physically linking) to a detectable 
substance, as well as indirect detection of the probe or antibody by reactivity with another 
reagent that is directly labeled Examples of indirect labeling include detection of a 
primary antibody using a fluorescently-labeled secondary antibody and end-labeling of a 
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DNA probe with biotin such that it can be detected with fluorescently-labeled 
streptavidin. The term "biological sample" includes tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and fluids present within a subject. The 
detection method of the invention can be used to detect an AAP mRNA, protein, or 
genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of an AAP mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of an AAP polypeptide include enzyme 
linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of an AAP genomic DNA include 
Southern hybridizations and fluorescence in situ hybridization (FISH). Furthermore, in 
vivo techniques for detecting an AAP include introducing into a subject a labeled anti- 
AAP antibody. For example, the antibody can be labeled with a radioactive marker 
whose presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample from the subject contains protein 
molecules, and/or mRNA molecules, and/or genomic DNA molecules. A preferred 
biological sample is blood. 

In another embodiment, the methods further involve obtaining a biological sample 
from a subject to provide a control, contacting the sample with a compound or agent to 
detect an AAP, mRNA, or genomic DNA, and comparing the presence of the AAP, 
mRNA or genomic DNA in the control sample with the presence of the AAP, mRNA or 
genomic DNA in the test sample. 

The invention also encompasses kits for detecting an AAP in a biological sample. 
For example, the kit can comprise: a labeled compound or agent capable of detecting an 
AAP or an AAP mRNA in a sample; reagent and/or equipment for determining the 
amount of an AAP in the sample; and reagent and/or equipment for comparing the 
amount of an AAP in the sample with a standard. The compound or agent can be 
packaged in a suitable container. The kit can further comprise instructions for using the 
kit to detect the AAP or nucleic acid. 

2. Prognostic assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with an aberrant 
AAP expression or activity. For example, the assays described herein, can be used to 
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identify a subject having or at risk of developing a disorder associated with AAP, nucleic 
acid expression or activity. Alternatively, the prognostic assays can be used to identify a 
subject having or at risk for developing a disease or disorder. Tthe invention provides a 
method for identifying a disease or disorder associated with an aberrant AAP expression 
or activity in which a test sample is obtained from a subject and the AAP or nucleic acid 
(e.g., mRNA, genomic DNA) is detected. A test sample is a biological sample obtained 
from a subject For example, a test sample can be a biological fluid (e.g., serum), cell 
sample, or tissue. 

Pognostic assays can be used to determine whether a subject can be administered 
a modality (e.g., an agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, 
small molecule, food, etc.) to treat a disease or disorder associated with an aberrant AAP 
expression or activity. Such methods can be used to determine whether a subject can be 
effectively treated with an agent for a disorder. The invention provides methods for 
determining whether a subject can be effectively treated with an agent for a disorder 
associated with an aberrant AAP expression or activity in which a test sample is obtained 
and the AAP or nucleic acid is detected (e.g., where the presence of the AAP or nucleic 
acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with the aberrant AAP expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an AAP 
to determine if a subject with the genetic lesion is at risk for a disorder characterized by 
aberrant angiogenesis. Methods include detecting, in a sample from the subject, the 
presence or absence of a genetic lesion characterized by at an alteration affecting the 
integrity of a gene encoding an AAP polypeptide, or the mis-expression of an AAP. Such 
genetic lesions can be detected by ascertaining: (1) a deletion of one or more nucleotides 
from an AAP; (2) an addition of one or more nucleotides to an AAP; (3) a substitution of 
one or more nucleotides in an AAP, (4) a chromosomal rearrangement of an AAP gene; 
(5) an alteration in the level of an AAP mRNA transcripts, (6) aberrant modification of an 
AAP, such as a change genomic DNA methylation, (7) the presence of a non-wild-type 
splicing pattern of an AAP mRNA transcript, (8) a non-wild-type level of an AAP, (9) 
allelic loss of an AAP, and/or (10) inappropriate post-translational modification of an 
AAP polypeptide. There are a large number of known assay techniques that can be used 
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to detect lesions in an AAP. Any biological sample containing nucleated cells may be 
used. 

In certain embodiments, lesion detection may use a probe/primer in a polymerase 
chain reaction (PCR) {e.g., (Mullis, US Patent No. 4,683,202, 1987; Mullis et al, US 
Patent No. 4,683,195, 1987), such as anchor PCR or rapid amplification of cDNA ends 
(RACE) PCR, or, alternatively, in a ligation chain reaction (LCR) (e.g., (Landegren et al., 
1988; Nakazawa et al, 1994), the latter is particularly useful for detecting point mutations 
in AAP-genes (Abravaya et al., 1995). This method may include collecting a sample 
from a patient, isolating nucleic acids from the sample, contacting the nucleic acids with 
one or more primers that specifically hybridize to an AAP under conditions such that 
hybridization and amplification of the AAP (if present) occurs, and detecting the presence 
or absence of an amplification product, or detecting the size of the amplification product 
and comparing the length to a control sample. It is anticipated that PCR and/or LCR may 
be desirable to use as a preliminary amplification step in conjunction with any of the 
techniques used for detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication 
(Guatelli et al, 1990), transcriptional amplification system (Kwoh et al, 1989); Q|3 
Replicase (Lizardi et al, 1988), or any other nucleic acid amplification method, followed 
by the detection of the amplified molecules using techniques well known to those of skill 
in the art. These detection schemes are especially useful for the detection of nucleic acid 
molecules present in low abundance. 

Mutations in an AAP from a sample can be identified by alterations in restriction 
enzyme cleavage patterns. For example, sample and control DNA is isolated, amplified 
(optionally), digested with one or more restriction endonucleases, and fragment length 
sizes are determined by gel electrophoresis and compared. Differences in fragment 
length sizes between sample and control DNA indicates mutations in the sample DNA. 
Moreover, the use of sequence specific ribozymes can be used to score for the presence of 
specific mutations by development or loss of a ribozyme cleavage site. 

Hybridizing a sample and control nucleic acids, e.g., DNA or RNA, to high- 
density arrays containing hundreds or thousands of oligonucleotides probes, can identify 
genetic mutations in an AAP (Cronin et al, 1996; Kozal et al, 1996). For example, 
genetic mutations in an AAP can be identified in two-dimensional arrays containing 



- 118- 



light-generated DNA probes as described in Cronin, et al, supra. Briefly, a first 
hybridization array of probes can be used to scan through long stretches of DNA in a 
sample and control to identify base changes between the sequences by making linear 
arrays of sequential overlapping probes. This step allows the identification of point 
mutations. This is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other 
complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the 
art can be used to directly sequence an AAP and detect mutations by comparing the 
sequence of the sample AAP-with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on classic techniques (Maxam and 
Gilbert, 1977; Sanger et al, 1977). Any of a variety of automated sequencing procedures 
can be used when performing diagnostic assays (Naeve et al, 1995) including sequencing 
by mass spectrometry (Cohen et al., 1996; Griffin and Griffin, 1993; Koster, 
WO94/16101, 1994). 

Other methods for detecting mutations in an AAP include those in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al, 1985). In general, the technique of "mismatch 
cleavage" starts by providing heteroduplexes formed by hybridizing (labeled) RNA or 
DNA containing a wild-type AAP sequence with potentially mutant RNA or DNA 
obtained from a sample. The double-stranded duplexes are treated with an agent that 
cleaves single-stranded regions of the duplex such as those that arise from base pair 
mismatches between the control and sample strands. For instance, RNA/DNA duplexes 
can be treated with RNase and DNA/DNA hybrids treated with Si nuclease to 
enzymatically digest the mismatched regions. In other embodiments, either DNA/DNA 
or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide and with 
piperidine in order to digest mismatched regions. The digested material is then separated 
by size on denaturing polyacryl amide gels to determine the mutation site (Grompe et ai, 
1989; Saleeba and Cotton, 1993). The control DNA or RNA can be labeled for detection. 
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Mismatch cleavage reactions may employ one or more proteins that recognize 
mismatched base pairs in double-stranded DNA (DNA mismatch repair) in defined 
systems for detecting and mapping point mutations in an AAP cDNAs obtained from 
samples of cells. For example, the mutY enzyme of E. coli cleaves A at G/A mismatches 
5 and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T mismatches (Hsu 

et aL, 1994). According to an exemplary embodiment, a probe based on a wild-type AAP 
sequence is hybridized to a cDNA or other DNA product from a test cell(s). The duplex 
is treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like (Modrich et al., US Patent No. 
10 0 5,459,039,1995). 

Electrophoretic mobility alterations can be used to identify mutations in an AAP. 
For example, single strand conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant and wild type nucleic acids 
~ (Cotton, 1993; Hayashi, 1992; Orita et aL, 1989). Single-stranded DNA fragments of 
1 5 ;H sample and control AAP nucleic acids are denatured and then renatured The secondary 
ill structure of single-stranded nucleic acids varies according to sequence; the resulting 
\f\ alteration in electrophoretic mobility allows detection of even a single base change. The 
! " = DNA fragments may be labeled or detected with labeled probes. The sensitivity of the 
assay may be enhanced by using RNA (rather than DNA), in which the secondary 
20 structure is more sensitive to a sequence changes. The subject method may use 

heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility (Keen et aL, 1991). 

The migration of mutant or wild-type fragments can be assayed using denaturing 
gradient gel electrophoresis (DGGE; (Myers et al, 1985). In DGGE, DNA is modified to 
25 prevent complete denaturation, for example by adding a GC clamp of approximately 40 

bp of high-melting GC-rich DNA by PCR. A temperature gradient may also be used in 
place of a denaturing gradient to identify differences in the mobility of control and sample 
DNA (Rossi ter and Caskey, 1990). 

Examples of other techniques for detecting point mutations include, but are not 
30 limited to, selective oligonucleotide hybridization, selective amplification, or selective 

primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 
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that permit hybridization only if a perfect match is found (Saiki et al., 1986; Saiki et aL, 
1989). Such allele-specific oligonucleotides are hybridized to PCR-amplified target DNA 
or a number of different mutations when the oligonucleotides are attached to the 
hybridizing membrane and hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology that depends on selective 
PCR amplification may be used. Oligonucleotide primers for specific amplifications may 
carry the mutation of interest in the center of the molecule (so that amplification depends 
on differential hybridization (Gibbs et ai, 1989)) or at the extreme S'-terminus of one 
primer where, under appropriate conditions, mismatch can prevent, or reduce polymerase 
extension (Prosser, 1993). Novel restriction site in the region of the mutation may be 
introduced to create cleavage-based detection (Gasparini et aL, 1992). Certain 
amplification may also be performed using Taq ligase for amplification (Barany, 1991). 
In such cases, ligation occurs only if there is a perfect match at the 3-terminus of the 5' 
sequence, allowing detection of a known mutation by scoring for amplification. 

The described methods may be performed, for example, by using pre-packaged 
kits comprising at least one probe (nucleic acid or antibody) that may be conveniently 
used, for example, in clinical settings to diagnose patients exhibiting symptoms or family 
history of a disease or illness involving an AAP. 

Furthermore, any cell type or tissue in which an AAP is expressed may be utilized 
in the prognostic assays described herein. 

3. Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on AAP activity 
or expression, as identified by a screening assay can be administered to individuals to 
treat, prophylactically or therapeutically, disorders, including those associated with 
angiogenesis. In conjunction with such treatment, the pharmacogenomics (i.e., the study 
of the relationship between a subject's genotype and the subject's response to a foreign 
modality, such as a food, compound or drug) may be considered Metabolic differences 
of therapeutics can lead to severe toxicity or therapeutic failure by altering the relation 
between dose and blood concentration of the pharmacologically active drug. Thus, the 
pharmacogenomics of the individual permits the selection of effective agents (e.g., drugs) 
for prophylactic or therapeutic treatments based on a consideration of the individual's 
genotype. Pharmacogenomics can further be used to determine appropriate dosages and 
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therapeutic regimens. Accordingly, the activity of an AAP, expression of an AAP nucleic 
acid, or an AAP mutation(s) in an individual can be determined to guide the selection of 
appropriate agent(s) for therapeutic or prophylactic treatment. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to modalities due to altered modality disposition and abnormal action in affected 
persons (Eichelbaum and Evert, 1996; Linder et aL y 1997). In general, two 
pharmacogenetic conditions can be differentiated: (1) genetic conditions transmitted as a 
single factor altering the interaction of a modality with the body (altered drug action) or 
(2) genetic conditions transmitted as single factors altering the way the body acts on a 
modality (altered drug metabolism). These pharmacogenetic conditions can occur either 
as rare defects or as nucleic acid polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) explains the phenomena of 
some patients who show exaggerated drug response and/or serious toxicity after taking 
the standard and safe dose of a drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer (EM) and poor metabolizer (PM). 
The prevalence of PM is different among different populations. For example, the 
CYP2D6 gene is highly polymorphic and several mutations have been identified in PM, 
which all lead to the absence of functional CYP2D6. Poor metabolizers due to mutant 
CYP2D6 and CYP2C19 frequently experience exaggerated drug responses and side 
effects when they receive standard doses. If a metabolite is the active therapeutic moiety, 
PM shows no therapeutic response, as demonstrated for the analgesic effect of codeine 
mediated by its CYP2D6-formed metabolite morphine. At the other extreme are the so- 
called ultra-rapid metabolizers who are unresponsive to standard doses. Recently, the 
molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 
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The activity of an AAP, expression of an AAP nucleic acid, or mutation content of 
an AAP in an individual can be determined to select appropriate agent(s) for therapeutic 
or prophylactic treatment of the individual. In addition, pharmacogenetic studies can be 
used to apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotype. This knowledge, 
when applied to dosing or drug selection, can avoid adverse reactions or therapeutic 
failure and thus enhance therapeutic or prophylactic efficiency when treating a subject 
with an AAP modulator, such as a modulator identified by one of the described 
exemplary screening assays. 

4. Monitoring effects during clinical trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of an AAP (e.g., the ability to modulate angiogenesis) can be applied not only in 
basic drug screening, but also in clinical trials. For example, the effectiveness of an agent 
determined by a screening assay to increase an AAP expression, protein levels, or up- 
regulate an AAP's activity can be monitored in clinical trails of subjects exhibiting 
decreased AAP expression, protein levels, or down-regulated AAP activity. Alternatively, 
the effectiveness of an agent determined to decrease an AAP expression, protein levels, or 
down-regulate an AAP's activity, can be monitored in clinical trails of subjects exhibiting 
increased the AAP expression, protein levels, or up-regulated AAP activity. In such 
clinical trials, the expression or activity of the AAP and, preferably, other genes that have 
been implicated in, for example, angiogenesis can be used as a "read out" or markers for a 
particular cell's responsiveness. 

For example, genes, including an AAP, that are modulated in cells by treatment 
with a modality (e.g., food, compound, drug or small molecule) can be identified To 
study the effect of agents on angiogenesis, for example, in a clinical trial, cells can be 
isolated and RNA prepared and analyzed for the levels of expression of an AAP and other 
genes implicated in the disorder. The gene expression pattern can be quantified by 
Northern blot analysis, nuclear run-on or RT-PCR experiments, or by measuring the 
amount of protein, or by measuring the activity level of the AAP or other gene products. 
In this manner, the gene expression pattern itself can serve as a marker, indicative of the 
cellular physiological response to the agent. Accordingly, this response state may be 
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determined before, and at various points during, treatment of the individual with the 
agent. 

The invention provides a method for monitoring the effectiveness of treatment of 
a subject with an agent (e.g., an agonist, antagonist, protein, peptide, peptidomimetic, 
nucleic acid, small molecule, food or other drug candidate identified by the screening 
assays described herein) comprising the steps of (1) obtaining a pre-administration 
sample from a subject; (2) detecting the level of expression of an AAP, mRNA, or 
genomic DNA in the preadministration sample; (3) obtaining one or more post- 
administration samples from the subject; (4) detecting the level of expression or activity 
of the AAP, mRNA, or genomic DNA in the post-administration samples; (5) comparing 
the level of expression or activity of the AAP, mRNA, or genomic DNA in the pre- 
administration sample with the AAP, mRNA, or genomic DNA in the post administration 
sample or samples; and (6) altering the administration of the agent to the subject 
accordingly. For example, increased administration of the agent may be desirable to 
increase the expression or activity of the AAP to higher levels than detected, i.e., to 
increase the effectiveness of the agent. Alternatively, decreased administration of the 
agent may be desirable to decrease expression or activity of the AAP to lower levels than 
detected, i.e., to decrease the effectiveness of the agent. 

5. Methods of treatment 

The invention provides for both prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant AAP expression or activity. Furthermore, these same methods of treatment may 
be used to induce or inhibit angiogenesis, by changing the level of expression or activity 
of an AAP. 

6. Disease and disorders 

Diseases and disorders that are characterized by increased AAP levels or 
biological activity may be treated with therapeutics that antagonize (i.e., reduce or inhibit) 
activity. Antognists may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be used include: (1) AAP peptides, or analogs, derivatives, 
fragments or homologs thereof; (2) Abs to an AAP peptide; (3) AAP nucleic acids; (4) 
administration of antisense nucleic acid and nucleic acids that are "dysfunctional" (i.e., 
due to a heterologous insertion within the coding sequences) that are used to eliminate 
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endogenous function of by homologous recombination (Capecchi, 1989); or (5) 
modulators (i.e., inhibitors, agonists and antagonists, including additional peptide mimetic 
of the invention or Abs specific to an AAP) that alter the interaction between an AAP and 
its binding partner. 

Diseases and disorders that are characterized by decreased AAP levels or 
biological activity may be treated with therapeutics that increase (i.e., are agonists to) 
activity. Therapeutics that upregulate activity may be administered therapeutically or 
prophylactically. Therapeutics that may be used include peptides, or analogs, derivatives, 
fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying 
in vitro for RNA or peptide levels, structure and/or activity of the expressed peptides (or 
AAP mRNAs). Methods include, but are not limited to, immunoassays (e.g., by Western 
blot analysis, immunoprecipitation followed by sodium dodecyl sulfate (SDS) 
polyacryl amide gel electrophoresis, immunocytochemistry, etc.) and/or hybridization 
assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in situ 
hybridization, and the like). 

7 . Prophylactic methods 

The invention provides a method for preventing, in a subject, a disease or 
condition associated with an aberrant AAP expression or activity, by administering an 
agent that modulates an AAP expression or at least one AAP activity. Subjects at risk for 
a disease that is caused or contributed to by an aberrant AAP expression or activity can be 
identified by, for example, any or a combination of diagnostic or prognostic assays. 
Administration of a prophylactic agent can occur prior to the manifestation of symptoms 
characteristic of the AAP aberrancy, such that a disease or disorder is prevented or, 
alternatively, delayed in its progression. Depending on the type of AAP aberrancy, for 
example, an AAP agonist or AAP antagonist can be used to treat the subject. The 
appropriate agent can be determined based on screening assays. 

8. Therapeutic methods 

Another aspect of the invention pertains to methods of modulating an AAP 
expression or activity for therapeutic purposes. The modulatory method of the invention 
involves contacting a cell with an agent that modulates one or more of the activities of 
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AAP activity associated with the cell. An agent that modulates AAP activity can be a 
nucleic acid or a protein, a naturally occurring cognate ligand of an AAP, a peptide, an 
AAP peptidomimetic, or other small molecule. The agent may stimulate AAP activity. 
Examples of such stimulatory agents include an active AAP and an AAP nucleic acid 
5 molecule that has been introduced into the cell. In another embodiment, the agent inhibits 

AAP activity. Examples of inhibitory agents include antisense AAP nucleic acids and 
anti-AAP Abs. Modulatory methods can be performed in vitro (e.g., by culturing the cell 
with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As 
^ such, the invention provides methods of treating an individual afflicted with a disease or 
1 0 0 disorder characterized by aberrant expression or activity of an AAP or nucleic acid 

i=L molecule. In one embodiment, the method involves administering an agent (e.g., an agent 
\ 4 \ identified by a screening assay), or combination of agents that modulates (e.g., up- 
:% J regulates or down-regulates) AAP expression or activity. In another embodiment, the 
~" method involves administering an AAP or nucleic acid molecule as therapy to 
1 5 l r\ compensate for reduced or aberrant AAP expression or activity. 

I II Stimulation of AAP activity is desirable in situations in which AAP is abnormally 

: A down-regulated and/or in which increased AAP activity is likely to have a beneficial 
^ effect. One example of such a situation is where a subject has a disorder characterized by 
aberrant angiogenesis (e.g., cancer). 
20 ( 9. Determination of the biological effect of the therapeutic 

Suitable in vitro or in vivo assays can be performed to determine the effect of a 
specific therapeutic and whether its administration is indicated for treatment of the 
affected tissue. 

In various specific embodiments, in vitro assays may be performed with 
25 representative cells of the type(s) involved in the patient's disorder, to determine if a 

given therapeutic exerts the desired effect upon the cell type(s). Modalities for use in 
therapy may be tested in suitable animal model systems including, but not limited to rats, 
mice, chicken, cows, monkeys, rabbits, and the like, prior to testing in human subjects. 
Similarly, for in vivo testing, any of the animal model system known in the art may be 
30 used prior to administration to human subjects. 

10. Prophylactic and therapeutic uses of the compositions of the invention 
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AAP nucleic acids and proteins are useful in potential prophylactic and 
therapeutic applications implicated in a variety of disorders including, but not limited to 
those related to angiogenesis. 

As an example, a cDNA encoding an AAP may be useful in gene therapy, and the 
protein may be useful when administered to a subject in need thereof. 

AAP nucleic acids, or fragments thereof, may also be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein is to be 
assessed. A further use could be as an an ti -bacterial molecule (i.e., some peptides have 
been found to possess anti -bacterial properties). These materials are further useful in the 
generation of Abs that immunospecifically bind to the novel substances of the invention 
for use in therapeutic or diagnostic methods. 

The following example is meant to not be limiting. 

Example 

Identification of genes differentially-regulated 

A comprehensive mRNA profiling technique (GeneCalling) was used to 
determine differential gene expression profiles of human endothelial cells undergoing 
differentiation into tube-like structures (Kahn et al 9 2000). To confirm the expression 
data from GeneCalling, independent experiments were undertaken that used gene-specific 
PCR oligonucleotide primer pairs and an oligonucleotide probe labeled with a fluorescent 
dye at the 5' end and quencher fluorescent dye at the 3' end.. Total RNA (50 ng) was 
added to a 50 (il RT-PCR mixture and run. 
The following data were collected: 



hEF G collagen gel 24 hr versus 4h 4.5 fold upregulated 

hTRG collagen gel 24 hr versus 4h 3.5 fold upregulated 

KLP collagen gel 24 hr versus 4h 3.5 fold upregulated 

myosin X collagen gel 24 hr versus 4h 3.5 fold upregulated 

NHR collagen gel 24 hr versus 4h 7.3 fold downregulated 

HBAZF collagen gel 24 hr versus 4h 2.1 fold upregulated 



Equivalents 



- 127- 



Although particular embodiments have been disclosed herein in detail, this has 
been done by way of example for purposes of illustration only, and is not intended to be 
limiting with respect to the scope of the appended claims that follow. In particular, it is 
contemplated by the inventors that various substitutions, alterations, and modifications 
5 may be made to the invention without departing from the spirit and scope of the invention 

as defined by the claims. The choice of nucleic acid starting material, clone of interest, or 
library type is believed to be a matter of routine for a person of ordinary skill in the art 
with knowledge of the embodiments described herein. Other aspects, advantages, and 
modifications considered within the scope of the following claims. 



10 
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